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Preface to the Revised Rdition 


In keeping with the rapid growth of differential psychology during 
tJie past decade, the present edition represents a thorough revision and 
a considerable enlargement of the original book. Four new chapters 
have been added, covering basic concepts of psychological testing 
(Ch. 2), biological and psychological factors in simple behavior de- 
velopment (Ch, 5, 6) and the effects of schooling upon intelligence 
(Ch. 8). The inclusion of the new Chapters 5 and 6 reflects a greater 
emphasis upon the developmental approach in the study of behavior 
differences, an approach also illustrated by the increasing number of 
longitudinal studies reported throughout the book. Much more mate- 
rial is now available on individual and group differences in person- 
ality characteristics. These findings have been discussed, together 
with the results on intelleclual differences, in the appropriate sections. 
The content of the chapters on trait organization (Ch. 15) and socio- 
economic differences (Ch. 23) is substantially new, most of the studies 
covered having been conducted during the last ten years. The remain- 
ing chapters have likewise been reorganized and rewritten in order to 
incorporate and integrate recent developments in each area. The 
present edition has also di,twn more extensively upon recent findings 
in genetics, anthropology, and sociology. 

Partly as a result of protracted vontroversics in certain areas, the 
past decade has witnessed an increasing methodol<')gical and theoreti- 
cal sophistication among workers in differential psychology. More 
rigid standards of experimental control have been demanded, tradi- 
tional procedures have been scrutinized and challenged, and a greater 
concern for sharply defined concepts has been evidenced. In recogni- 
tion of these developments, the prc'ent book places more emphasis 
upon methodological problems and basic concepts. This is especially 
illustrated in the reorganized Part I, whose aim is to give the student 
a brief over-all introduction to certain important concepts of psycho- 
logical testing, heredity and environment, and the nature of individ- 
ual differences, prior to the presentation of specific data in Parts II 
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viii Preface to the Revised Edition 

and III. Even in the subsequent treatment of individual or group dif- 
ferences, however, the obtained findings are always interpreted oper- 
ationally in the light of experimental conditions and methodology. 

While keeping abreast of the many changes in the field, the present 
book retains the fundamental objectives of the original edition. Firs^, 
differential psychology is presented, not as a separate field of psy- 
chology, but as one approach to the understanding of behavior. Its 
fundamental questions are no different from tliose of general psychol- 
ogy. It is apparent that if we can explain why individuals react differ- 
ently from one another, we shall understand why each individual reacts 
as he does. The data of differential psychology should thus help to 
clarify the basic mechanisms of behavior. It is primarily from this 
point of view that the problems of individual and group differences 
are surveyed in the present text. 

A second aim of the book has been to coordinate the various topics 
which have usually been joined together loosely under the caption of 
“individual differences,"’ The phenomenal development of differential 
psychology during the past decade has resulted in an increasing spe- 
cialization of interest among research workers and a frequent disregard 
of the broader implications of the data. The mulual interreliilion of 
the various problems has often been obscured by the accumulation of 
data at a more rapid pace than they could conilorlably be assimilated. 
For this reason, the writers have endeavored to bear constantly in 
mind Ihc interrelationships among different types of investigations 
and have attempted to present a s)sleniatic organization and integra- 
tion of the material. No chapter stands alone. Each is related to what 
preceded it and to what is to follow. 

Thirdly, it has been our aim to report the major findings of differ- 
ential psychology in a h^rm teadilv coniprchensiblc to the collate stu- 
dent. Our purpose has been to present the material clearly and inter- 
estingly, at the same time avoiding the errors of significant omission 
and falsification which so frequently characterize many attempts at 
popularization. Since much of the knowledge in this field is found only 
in highly technical sources, certain topics have customarily received 
cursory or superficial treatment in texts on individual differences. Even 
the more advanced student of psychology who has specialized in other 
phases of the subject often finds it impossible to keep informed on 
current developments in certain branches of differential psychology. 
The writers are convinced, however, that a non-technical and easily 
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comprehensible presentation of such topics is both feasible and desir- 
able. An understanding of the basic concepts and major findings within 
any area need not be limited to those who have mastered its special- 
ized techniques. 

* One further point deserves special mention. The present book is 
not intended to be a lileraturc survey. First and foremost, it is a 
hook designed to develop in the student the intellectual skills needed 
for understanding and evaluating the data of differential psychology. 
Throughout the book, special emphasis has been placed upon the 
examination of common pitfalls and sources of error in the interpreta- 
tion of obtained results. We have thus hoped to provide the student 
'^ith certain tools M'hcrehv he may evaluate for himself a set of data 
with whuh he is confronted. This would seem to be iar more impor- 
tant than the mere presentation of a body oC facts. The development 
of critical ability and of a dispassionate and objective attitude toward 
human behavior is more urgently needed today than ever belorc. 

The writers arc pleased to acknowledge the cooperation of several 
colleagues in the preparation of this revision. Professors Robert T. 
Rock, Jr., and Dorothea McCarthy of the Department of Psychology, 
Fordham University, contributed many valuable suggestions and pro- 
vided assistance in countless other ways. The writers are indebted to 
Professor Robert L. Thorndike of Teachers College, Columbia Univer- 
sity, for his intensive reading of the chapter on Schooling; and Intelli- 
^ence and for his constructive comments in this area. Thanks arc ex- 
tended to Professor Charles A. Berger, Chairman of the Department 
of Biology, Fordham University, for his critical reading of the sections 
dealing with genetics. The writers wish to express their appreciation 
to Professor Frank Lee of the Scho4>l of Engineering, Columbia Uni- 
versity, for his skillful and painstaking preparation of the illustrations. 
Finally, grateful acknowledgment is made to Mrs. Enrica Tunnell, 
Librarian, Psychology Reading Room, Columbia University, for her 
ready aid in many bibliographic matters. 


A. A. 

J. P. F. 
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Historical Introduction 


CHAPTER 

1 


Man has always been aware of differences among his fellow beings. 
He has, to be sure, entertained various theories, beliefs, or supersti- 
lions regarding the causes of such differences, and has interpreted them 
differently according to his ow^n traditional background, but he has at 
all times accepted the fact of their existence. Among primitive peo- 
ples, unusual deviations in behavior are clearly recognized. Thus 
many primitive groups acknowledge exceptional artistic talent among 
their members and encourage the development of specialized artists. 
The presence of hysterical or cpileptoid symptoms, paranoid trends, 
and similar peculiarities of behavior has frequently been regarded as 
an index of religious or magical powers and has been treated accord- 
ingly. At any level of cultural development, specialization of labor 
itself implies a tacit assumption of differences among people. 

Nor is this response to individual differences limited to the human 
species. Instances from infrahimian behavior can readily be found. 
The acceptance of certain individuals as “leaders” by herds of ele- 
phants, buffaloes, and similar gregarious animals has been widely dis- 
cussed in the literature both of fact and of fiction. In communities of 
baboons, a certain member is posted as “sentinel” to watch for the 
approach of danger and warn the others by conventional cries. The 
frequently described “hacking” or hen-pecking behavior of chickens 
is another case in point. A definite relationship of social domination 
is often displayed by chickens in the barnyard, this fighting or “hack- 
ing” behavior usually centering about the acquisition of food. In such 
cases, A will attack B, although the reverse will not occur. Violent 
conflicts often ensue when the authority of the chief “hacker'’ in the 
group is disputed. These and many other examples illustrate the preva- 
lence of dilTercntial responses to individuals within one’s own group. 

The objective and quantitative investigation of individual differences 
in behavior phenomena is the domain of differential psychology. What 
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4 Differential Psychology 

is the nature and extent of such differences? What can be discovered 
as to their causes? How are the differences affected by training, 
growth, physical conditions? In what manner are the differences in 
various traits related to one another, or organized? These are some of 
the fundamental questions raised by differential psychology. 

Part I of the present book furnfshes a brief histoikal ancJ methodo- 
logical oiientatkm to the field of differential psychology, including an 
introduction to its fundamental concepts. Among the concepts treated 
are those underlining psychological testing, the distribution of indi- 
vidual differences, and heredity and environment. The analysis of 
specific data on individual differences forms the content of Part IT, 
this analysis being developed in terms of the effect of different factors 
upon behavior development. Thus the contributions of such factors 
as training, age, familial relationships, and structural coi relates are 
considered in turn, followed by a discussion of the nature and inter- 
relationships of psychological traits. 

Differential psychology is al o concerned with an analysis of the 
nature and characretisiks of major traditional cjfoupim^s, such as the 
subnormal and the genius, the sexes, and racial, national, and cul- 
tural diMsions. This furnishes the subject matter of Pait HI. The study 
of such gioup differences serves a threefold purpose. In the first place, 
one cannot ignore the fact that such groupings are being made in the 
practical Tca^m of everyday life. These distinctions cannot be swept 
away casually on the grounds that, perhaps, the study of individual 
differences reveals no need for them or for any sharp divisions into 
clear-cut categoiies. Certain groups are recognized and tesponded to 
as distinctive in our present-day society. For a purely practical reason, 
therefore, these groups must be investigated, in the hope that the 
specific findings may throw some light upon their nature and possibly 
further a more intelligent practical understanding of them. 

Secondly, the comparative investigation of different groups should 
help to clarify the fundamental problems of individual differences in 
general. In such groups we can see the principles of individual dif- 
ferences in operation and can note their effects. Group differences in 
behavior, when considered in conjunction with other concomitant 
differences among the groups, furnish an excellent available means of 
analyzing the causes of variability. 

Thirdly, the comparison of a psychological phenomenon as it 
occurs in different groups may contribute toward a clearer under- 
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standing of the phenomenon itself. The findings of general psychology, 
when tested in widely varying groups, are sometimes found to be not 
so general as was supposed. To study a phenomenon in all its varied 
manifestations is to have a better grasp of its essential nature. 

• Notwithstanding the early and widespread recognition of individual 
differences in the practical adjustments of everyday life, the system- 
atic investigation of such differences is a relatively recent develop- 
ment in psychology. We may therefore begin by considering the 
principal historical developments in the field of differential psychology. 


INDIVIDUAL DIFFERENCES IN PRE-EXPERIMENTAL 
PSYCHOLOGICAL THEORY ^ 

One of the earliest iiistances of the explicit recognition of individual 
differences is to be found in the Republic of Plato. A fundamental aim 
of Plato’s ideal state is, in fact, the assignment of individuals to the 
special tasks for which they arc suited. In Book II of the Republic 
appears the following statement: “Really, 1 said, it is not improbable; 
for I recollect myself, after your answer, that, in the first place, no 
two persons are born exactly alike, but each differs from each in nat- 
ural endowments, one being suited for one occupation and another 
for another” (11, p. 60). Plato proposes a series of “actions to per- 
form” for use as tests of military aptitude on those who are to be the 
soldiers of his ideal state. These actions are designed to sample the 
various traits considered c .scniial to military prowess, and represent 
the first systematic description of an aptitude test on record. 

Nor did the versatile genius of Aristotle overlook individual varia- 
tion. He discusses at some length group differences, including species, 
racial, social, and sex differences in mental and moral traits. In many 
of his works there is also an implicit assumption of individual differ- 
ences, although it is interesting to note that Aristotle does not offer 
any extensive treatment of these differences as such. One gets the 
impression that he regards the existence of individual variation as too 

^ To supplement the brief historical sketch of the study of individual differences 
given in the present and following sections, the reader is referred to any of the 
standard works on the history of psychology, such as Boring (5), Murphy (31), and 
Rand (37). 

The numbcis in parentheses here and throimhout the book refer to the num- 
bered References at the end of the respective chapter. 
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obvious to need special mention. That he attributes such differences 
at least in part to innate factors seems to be indicated by a number 
of statements such as the following: 

Perhaps, then, some one may say, “Since it is in my power to be just 
and good, if I wish I shall be the best of all men.” This, of course, is not 
possible. . . . For he who v/ills to be best will not be so, unless Nature 
also be presupposed (38, Magna Moralia, 1 187*0- 

Throughout the several Ethics of Aristotle, there appear passages 
which imply individual variation. The following statement, for exam- 
ple, leaves little doubt regarding Aristotle’s position on this point: 

After these distinctions we must notice that in everything continuous 
and divisible there is excess, deficiency, and the mean, and these in rela- 
tion to one another or in relation to us, c.g., in the gymnastic or medical 
arts, and in those of building and navigation, and in any sort of action, 
alike scientific and non-scientific, skilled and unskilled. For motion is con- 
tinuous, and action is motion (38, Ethica Eiidcniia, 1220''). 

Aristotle then proceeds to describe the characteristics of men possess- 
ing an excess or a deficient amount of various traits such a^ irasci- 
bility, audacity, shamelessness, and others. 

In the Scholasticism of the Middle Ages, individual differences were 
largely neglected. Philosophical generalizations regarding the nature 
of the mind were formulated thr6Ligh “rational,” or speculative, rather 
than “empirical” means. 1 he observation of individuals thus played 
little or no part in the development of such doctrines. Of particular 
interest for differential psychology is the “faculty psychology” ad- 
vanced by St. Augustine and Thomas Aquinas. Such “faculties” as 
“memory,” “imagination,” and “will” have been regarded by some as 
the precursors of the traits and factors currently identified through 
statistical analysis of test scores. These empirically determined factors, 
however, differ in a number of important ways from the rationally 
derived faculties of Scholastic philosophy (cf. Ch. 15). 

The many varieties of A';sociationism which flourished from the 
seventeenth to the nineteenth centuries likewise took little heed of 
individual differences. It was with the elaborate mechanics whereby 
“ideas” become associated, giving rise to complex mental processes, 
that the associationists were primarily concerned. Their statements 
were general principles with no allowance for individual variation. 
Bain, the last of the so-called pure associationists, docs, however, give 
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some attention to individual differences in his writings. The following 
passage is taken from his book on The Senses and the Intellect (1855 ) : 
“There is a natural force of adhesiveness, specific to each constitution, 
and distinguishing one individual from another. This property, like 
Almost every other assignable property of human nature, I consider to 
be unequally distributed” (2, p. 237). 

A simultaneous development in educational theory should prob- 
ably be included at this point. Tn the writings and practices of a group 
of “naturalist” educators of the late eighteenth and early nineteenth 
centuries, including Rousseau, Pestalozzi, Herbart, and Froebel, there 
is found a definite shift of interest to the individual child fcf. 29). 
Lducational policies and methods were to be determined, not by 
external criteria, but by direct observation of the child and his capaci- 
ties. The emphasis still seemed to be, however, on the observation of 
the individual as representative of individuals in general, rather than 
as distinct from other individuals. Although statements can be found 
in the writings of these educators to the effect that individuals differ 
and that their education should be adapted to these differences, still 
the emphasis is laid more heavily upon free, “natural” education in 
contrast to externally and arbitrarily imposed procedures, rather than 
upon individual differences themselves. 'I'he term “individual” is often 
used to mean simply “human nature.” 

Finally, mention may be made of the various treatises on race and 
racial psychology which appeared in the late eighteenth and early 
nineteenth centuries. Discussions of race differences are to be found 
in the works of such writeis as Buffon, Herder, and de Gobineau, the 
last having been especially influential in determining subsequent 
popular beliefs about race. 

THE PERSONAL EQUATION TN ASTRONOMY 

Curiously enough, the first systematic measurements of individual dif- 
ferences were undertaken not in psychology but in the old and time- 
honored science of astronomy. In i796, Maskelyne, the astronomer 
royal at the Greenwich Observatory^ dismissed Kinnebrook, his assist- 
ant, because the latter observed the times of stellar transits nearly a 
second later than he did. The method employed at the time to make 
such observations was the “eye and ear” method. This method in- 
volved not only coordination of visual and auditory impressions, but 
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also rather complex spatial judgments. The observer noted the time 
to a second on the clock, then began to count seconds with the heard 
beats of the clock, at the same time watching the star as it crossed 
the field of the telescope. He noted the position of the star at the last 
beat of the clock just before it reached the “critical” line in the fielcl; 
then, similarly, he noted its position with the first beat immediately 
after it had crossed that line. Fiom these observations, an estimate 
was made in tenths of a second of the exact time when the star 
crossed the critical line. This was the accepted procedure and was 
regarded as accurate to one- or two-tenths of a second. 

In 1816 Bessel, astronomer at Konigsberg, read of the Kinnebrook 
incident in a history of the Greenwich Astronomical Observatory, 
and became interested in measuring what came to be known as the 
‘'personal equation” of difierent observers. Originally, the personal 
equation referred to the difTcience in seconds between the estimates 
of two observers. Bessel collected and published data on several 
trained observers, and pointed out not only the presence of such a 
personal equation or error when compaiing any two observers, but 
also the variability in the equation from time to time. This represents 
the first published recoid of quantitative data on individual differences. 

Many astronomers followed up Bessel’s measurements. In the latter 
half of the nineteenth century, with the introduction of chronographs 
and chronoscopcs, it became possible to measure the personal equation 
of a given observer without reference to any other observer. The 
attempt was made to reduce all observations to their objectively cor- 
rect values without rctcrcnce to a system of time based upon one 
observer as a standard. Astronomers also undertook an analysis of 
the various conditions which affected the size of the personal equation. 
It was this latter problem, rather than the measurement of individual 
differences, w'hich was taken up by the early experimental psycholo- 
gists in their studies of "‘reaction time.” 

THE RISE OF FXPFRIMENTAL PSYCHOLOGY 

During the latter half of the nineteenth century, psychology began to 
venture away from its armchair and enter the laboratory. Most of the 
early experimental psychologists were physiologists whose experiments 
gradually came to take on a psychological tinge. As a result, both the 
viewpoints and the methods of physiology were frequently carried 
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over directly into the infant science of psychology. In 1879, Wilhelm 
Wundt established the first laboratory of experimental psychology at 
Leipzig. Experiments of a psychological nature had been performed 
previously by Weber, Fcchner, Helmholtz, and others, but Wundt’s 
laboratory was the first to be devoted exclusively to psychology and to 
offer facilities for training students in the methods of the new science. 
Students from many parts of the world were attracted to Wundt’s 
laboratory, and upon their return founded laboratories in their own 
countries. 

The problems investigated in these early laboratories testify to the 
close kinship of experimental psychology with physiology. The study 
of visual and auditory sensation, reaction time, psychophysics, and 
association constituted nearly the entire field of experimentation. It 
was characteristic of the early experimental psychologists either to 
ignore individual difL fences or to regard them simply in the nature of 
experimental “errors.” 1 he greater the individual variation in a phe- 
nomenon, the less accurate would be the generalizations regarding 
such a phenomenon. The extent of individual diflercnces would thus 
represent the “probable error” to be expected in the application of 
the general formuiations. 

It is appiirent that the ri^c ol cxpciimcntal psychology shifted the 
emphasis away from the study of individual differences rather than 
toward it. Its one contribution to the development of a differential 
psychology is to be found in its demonstration that psychological 
phenomena are amenable to objective and even quantitative investiga- 
tion, that psychological th orio c«in be tested by actual data, that 
psychology, in short, could become an empirical science. Such a step 
was required before theories about the individual could be replaced by 
studies on individual differences. 

GAT TON AND TIIL UfOIOGlCAl INFIUTNCT 

With the spread of Darw^inism in the late nineteenth century, psychol- 
ogy became increasingly biological in its approach. One of the most 
widely known of Darwin’s followers v as Sir Francis Gallon, who first 
attempted to apply the evolutionary principles of variation, selection, 
and adaptation to the study of human individuals. Galton’s scientific 
pursuits were many and varied, but they were unified by an under- 
lying interest in the study of heredity. The science of eugenics, whose 
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aim IS the control and direction of human evolution, was originated 
by Galton. In 1869, he published a book entitled Hereditary Genius, 
in \\hich, by the application of the now well-known family history 
method, he tried to demonstrate the inheritance of specific talents in 
various hclds of work. In connection with the study of human inhcrfi- 
ance, it soon became apparent that related and unrelated individuals 
must be measured, objectively and in large numbeis, in order to dis- 
co\er the degrees of resemblance among them For this purpose, 
Galton dcMscd numerous tests and measure and in 18S2 established 
his famous anthropometric laboratoi^ at South Kensington Museum 
in London There, for the p.i>mcnt of a small fee, individuals could 
be tested in sensory disciimination, motor capacities, and other simple 
processes 

Ihiough the measurement of sensory processes, Galton hoped to 
arrive at an estimate ol the subiecfs intellectiia’ level In the Inqiiuics 
into Human Faailu a collection of miscellaneous cssavs published 
in 1883, he wrote “The onl> intoimation that icaches us concerning 
outw ird events appears to pass tlirough the avenue ol our senses, and 
the moie pciccptive the senses are of difference, the larger is the held 
upon which our iiulgruviit and intelligence can act’' ( M, p Tl) And 
again on the basis ol imdings on the interior sensitivit> of idiots, he 
observes that scnsor\ discriminative capacity “would on the whole be 
liighcst among the intelleetuallv ablest’' ( 13, p 29) For this reason, 
measures o( sensory capacitv, such as \ision and hearing, constituted 
a relatively large portion of the tests which Galton constructed and 
employed \mon^» these tests rnav be mcntionecf the Gallon bar lor 
visual discrimination of length, the Galton whistle for the determina- 
tion of the higticst audible pitch, kimcsthetic disci immation tests based 
on the ai rangement ot a senes ol weights as well as tests of strength 
of movement, speed of simple reactions and many others of a similar 
nature 

Galton also initiated the use of ‘ free association” tests, a technique 
which was subsequently adopted and further developed by Wundt 
Gabon’s study of mental imagery is well known and represents the 
first extensive psychological use ol questionnaire methods In this 
questionnaire, the subject was directed “to think of some definite 
object — suppo‘^e it is your breakfast table as you sat down to it this 
morning— and consider carefully the picture that rises before your 
mind’s eye” ( 13, p 84) They were then to describe the picture with 
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reference to illumination, definition, and coloring. Wide individual 
and group differences were revealed by this analysis of imagery. 

A further and very significant contribution of Galton to differential 
psychology was his development of statistical methods for the analysis 
of the data of individual differences. Formerly, statistics had been 
chiefly the tool of the trained mathematician and the professional 
gambler. Statistical techniques were not available in a form which 
would enable the mathematically untrained worker in the biological 
sciences to employ them. Cialton realized the need for such techniques 
and developed many of the statistical procedures in current use today. 
This phase of his woik has been extended and increased in scope by 
many eminent students, chiel among whom is Karl Pearson, who suc- 
ceeded Galton as director of the anthropometric laboratory in 1911. 

LARIY I XPLRIMLNI AlION WH H ThSTS 

The term “mental tc^t” was first employed in 1890 by Cattell, in an 
article entitled Mental Tests and Mea^ntcmcnis (9). James McKeen 
Cattell was an American student ol Wundt. In 1888, having obtained 
his doctorate at Leipzig, he returned to this country, where he was 
instrumental both in the spread of experimental psychology and in 
the development of mental testing. He had also come under the influ- 
ence of Galton'’s work in test construction and statistics. Thus in 
Cattell we find a convergence ot iwo contemporary movements in 
psychology: the rise of the experimental method and the measurement 
of individual dilferenccs. i wa^ characteristic of all the early Ameri- 
can mental tests that they de\ eloped in the psychological laboratory 
and partook of the nature of the ex^ erimcntal psychology of the time. 
This w'as not true of man> oi the l:sts developed in other countries. 

In additam to his experiments on reaction time, attention span, 
controlled association, leading, ps>choph>sics, and similar problems, 
Cattell constiuctcd a series of tests which were administered for many 
years to freshmen and seniors at Columbia College. This series in- 
cluded the following tests: - (1) ^rrongth of giip; (21 rate of arm 
movement; (3) two-point threshold on the back of the hand; (4) 
amount of pressure required to produce pain on the forehead; (5) 
least noticeable dilTcrencc in weights, (6) reaction time to sound; 
(7) speed of color naming; (8) bisection of a 50-cm. line; (9) repro- 

^ Tor a fullei cR'sciiplion, ci Cattcl and Fan and (10). 
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duction of a 10-sccond lime interval; (10) auditory memory span for 
letters. This list is typical of the various test series which appeared at 
the lime. 

In 1891, Munstcrberg (30) described a series of tests which he 
had employed on school children. Tests of reading, controlled associa- 
tion of various sorts, judgments memory, and other simple mental 
processes were included. At the 1893 Columbian Exposition at Chi- 
cago, Jastrow administered a series of sensory, motor, and simple 
perceptual tests to all persons interested. Norms of physical growth 
and mental development were presented with the tests (cf. 34, 35). 

What is probably the first attempt to evaluate test scores in terms 
of an independent criterion is to be found in the study by Bolton (4) 
reported in 1892. Bolton analyzed data collected by Boas on about 
1500 school children. The children’s memory spans were compared 
with their teachers’ estimates of “intellectual acuteness,” very little 
correspondence being found. Gilbert (16), in 1893, compared 
teachers’ estimates of “general ability” on some 1200 children with 
their scores on eight tests of sensory and motor functions, reaction 
time, sensory memory, and suggestibility. Three years Inter, Gilbert 
(17) described some additional tests and reported the lesults obtained 
with them on several hundred children. The data were analyzed in 
respect to sex differences, intellectual growth, and the relationship of 
mental and physical dcvelopmtjnt. 

In Germany, Ochrn (33), a pupil of Kraepclin, published in 1889 
the results of an intensive study of a scries of tests on ten subjects. 
The tests had been rather arbitrarily selected to measure perception, 
memory, association, and motor functions. In 1895, Kraepelin (28) 
proposed a set of traits which he regarded as basic in the characteriza- 
tion ol any individual. He also devised tests for the measurement of 
these traits, most of the tests involving simple arithmetic operations. 
These tests were of rather dubious validity for measuring the traits 
in question, and in addition they were quite impracticable, some of 
them requiring several days for their completion. 

Research on mental tests was also being conducted simultaneously 
under the direction of the Italian psychologist Ferrari. In an article 
appearing in 1896, some of these tests were described (20). They 
included measures of vasomotor activity, motor strength and skill, 
range of apprehension, description of pictures, and temporal estima- 
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tion. Interesting individual differences were reported in many of these 
tests. 


BEGINNINGS OF DIFFERENTIAL PSYCHOLOGY 

A1* the turn of the century, differential psychology had begun to 
assume definite shape. In 18^5 Binet and Henri published an article 
entitled La psydiolo^ie indixiduelle (3), which represents the first 
systematic analysis of the aims, scope, and methods of differential 
psychology. Their opening sentence suggests the status of this branch 
of psychology at the time. It reads: “We broach here a new subject, 
difficult and as yet very mcagerly explored” (p. 41 1 ) . Binet and Henri 
put forth as the two major problems of differential psychology, first, 
the stud) of the nature and extent of individual differences in psy- 
chological processes; and secondly, the discovery of the interrelation- 
ships of mental procciscs within the individual, so that we may arrive 
at a classification of traits and determine which arc the more basic 
functions. 

In 1900 appeared the first edition of vStern’s book on differential 
psychology, under the title Ober Psychologic der individucllen Differ- 
enzen (42). Part I deals with the nature, problems, and methods of 
differential psychology. Within the scope of this branch of psychology 
Stern included differences among individuals as well as among racial 
and cultural groups, occupational and social levels, and the two sexes. 
The fundamental problem of differential psychology he characterized 
as threefold: (1 ) What is the nature and extent of differences in the 
psychological life of indiviuuals and gioups? (2) What factors deter- 
mine or affect these differences? In this connection he mentioned 
heredity, climate, social or cultura* level, training, adaptation, etc. 
(3) How arc the differences manifested'^ Can they be detected by 
such indices as handwiiting, facial conformation, etc.? Stern also 
included a discussion of the concepts of psychological type, individ- 
uality, and normality and abnormality. Under the methods of differ- 
ential psychology, he gave an evaluation of introspection, objective 
observation, the use of material fron^ history and poetry, the study of 
culture, quantitative testing, and experiment. Part U contains a gen- 
eral discussion and some data on individual differences in various 
psychological traits, from simple sensory capacities to more complex 
mental processes and emotional characteristics. Stern’s book appeared 
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in a highly revised and enlarged edition in 1911, and again in 1921, 
under the title of Die Differcntielle Psychologie in ihren rnethodischen 
Grundlagcn (43). 

In America committees were being appointed to investigate testing 
methods and to sponsor the accumulation of data on individual differ- 
ences. At its 1895 meeting, the •American Psychological Association 
appointed a committee ''to consider the feasibility of cooperation 
among the various psychological laboratories in the collection of 
mental and physical statistics" (10, p. 619). In the following year, 
the American Association for the Advancement of Science established 
a standing committee to organize an ethnographic survey of the white 
population in the United Slates. Cattcll one of the members of this 
committee, pointed out the importance of including psychological 
tests in this survey and suggested that its work be coordinated with 
that proposed by the American Psychological Association (10, pp. 
619-620). 

1 he application of the newly devised mental tests to various groups 
was also getting under way, R. L. Kelly (25 ) in 1903 and Norsworthy 
(32) in 1906 compared normal and feebleminded children on sensori- 
motor and simple mental tests, and called attention to the continuous 
gradation in ability which exists between these groups, the feeble- 
minded not constituting a distinct category. In 1903 appeared Thomp- 
son's The Menial Trails of Sex (47), the result of several years’ test- 
ing of men and women with a variety of tests. This represents the first 
comprehensive investigation on psychological sex differences! Tests of 
sensory acuity, motor capacities, and a few simple mental processes 
were also being administered for the first time to various racial groups. 
A few scattered investigations appeared before 1900. In 1904 Wood- 
worth (50) and Bruner (7) tested several primitive groups at the 
St. Louis Exposition. In the same year appeared Spearman’s original 
article putting forth his Two-Factor theory of mental organization 
and introducing a statistical technique for investigating the problem. 
Thus, shortly after the beginning of the century, the foundations had 
been laid for virtually every branch of differential psychology, 

INTELLTGRNCH TESTING 

The intelligence test is a product of the twentieth century. The early 
mental tests were predominantly sensori-motor or very simple in 
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nature. This was no doubt a carry-over from the sensationism current 
in the psychological laboratories of the time. Complex mental proc- 
esses were believed to be best understood by analyzing them into their 
elementary components, usually of a sensory nature. Most of the 
efforts of the early experimentalists were therefore devoted to the 
study of simple sensori-motor reactions, and this influence left its mark 
on the newly developing mental tests. 

Binet and Henri, in their 1895 article (3), were the first definitely 
to point out the need for more complex tests to measure “intelligence.'’ 
They examined the five most comprehensive current test series, those 
of Cattell, Miinsterberg, Jastrow, Kraepelin, and Gilbert, and found 
all pf them greatly overweighted with sensory tests and lacking in tests 
of complex processes. From an analysis of the available data, they 
concluded that individual differences are more marked in complex 
tasks and that the lattei arc therefore better suited to the study of such 
differences. Partly to remedy this deficiency in the current tests, Binet 
and Henri described ten types of tests which in their opinion would 
yield the largest and most significant individual differences. The series 
included tests of memory, mental imagery, imagination, attention, 
comprehension, suggestibility, aesthetic appreciation, moral feelings, 
muscular force and force of will, and motor ability and visual discrim- 
ination. The entire scries, according to the authors, would require 
only from one to one and one-half hours. 

In 1S97, Ebbinghaus (12) proposed a theory to the effect that 
intelligence is the ability to combine or integrate the items of experi- 
ence, and offered the sente i ^e completion test as a technique for 
measuring this ability. In this test, the subject is presented with sen- 
tences in which certain of the words missing and he is required to 
fill in the proper words, fn experiments on German school children, 
Ebbinghaus had found this test more effective than simpler tests of 
calculation and memory. The completion test showed the most regular 
increase in score with age and it was also the only one of the tests 
employed which differentiated clearly among those pupils within each 
grade whose scholastic standing was good, average, or poor. Binet's 
contention for the superiority of the more complex tests in dift'erential 
psychology was thus corroborated. 

Two American studies of this period lent further support to Binet's 
statements. One of these studies (39) was conducted by Sharp, a 
student of Titchener, and was designed as a specific investigation of 
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the conclusions of Binet and Henri. A set of tests, modeled largely on 
those of Binet and lienri, was administered to seven advanced psy- 
chology students. The experiment was very intensive and included the 
repetition of similar tests on different days to determine the consist- 
ency of the processes tested. In general, although the need for further 
controls and refinements was suggested, the tests proved satisfactory 
and yielded sizable individual differences despite the homogeneous 
and select nature of the group. Sharp concluded: “We concur with 
Mm. Binet and Henri in believing that individual psychical differences 
should be sought for in the complex rather than in the elementary 
processes of mind, and that the test method is the most workable one 
that has yet been proposed for investigating these processes” (39, 
p. 390). A few years later, Wissler (49) published the results of his 
correlation analysis of the data collected in Cattell’s laboratory. The 
correlations showed “little more than a chance relation” among the 
tests, and also a negligible correspondence with academic grades. Thus 
the inadequacy of the simple tests originally employed was further 
demonstrated. 

Against this background of theory and data appeared the first intelli- 
gence scale, fn 1904 the French Minister of Public Instruction ^ap- 
pointed a committee to investigate the causes of retardation among 
public school children. Binet was one of the members of this com- 
mittee. As a direct outgrowth of hig work in this connection, Binet 
published, in collaboration with Simon, the 1905 scale for measuring 
intelligence. This scale consisted of 30 problems arranged in a rough 
order of difficulty. In 1908 appeared Binet’s first revision of the scale, 
in which the tests were grouped into age levels and the concept of 
“mental age” was introduced. The scale was again revised in 1911, 
the year of Binct’s untimely death. 

The Binet tests have been translated into most of the principal lan- 
guages and their use has spread over every continent. In America, 
several different revisions have appeared, of which the most widely 
known is undoubtedly the Stanford-Binet, prepared by Terman and 
his associates at Stanford University, The intelligence quotient (IQ), 
found by dividing the child’s mental age by his chronological age, was 

® The child’s score on an age scale is expressed as a mental age (MA). If, for 
example, he passes successfully all of the tests assigned to the 10-year level, he has 
a menial age of 10, regardless of what his chronological age may be. 
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jBrst employed in 1916 in the Stanford-Binet, although the use of such 
a ratio had been previously discussed by Stern and others. The Stan- 
ford-Binet itself has undergone repeated revision, its latest revision 
(often referred to as the Terman-Merrill Scale) having appeared in 
1937. This scale is available in two equivalent forms, L and M, which 
can be employed interchangeably. lts*age range extends from two years 
to the adult mental level of 15. There arc in addition three “Superior 
Adult” levels of increasing difficulty (cf. 46). 

Of special interest is the 1922 Kuhlmann-Binct revision which ex- 
tended the scale downward to the three months age level.'^ The con- 
struction of scales for measuring the intelligence of very young chil- 
dren represents one of the most recent developments in psychological 
testing.'* Such tests have now become differentiated into infant tests, 
covering the period from birth to about 1 Vi years, and preschool tests, 
designed primarily for children between the ages of Wi and 5. The 
former consist essentially of scries of developmental norms which can 
be applied to an evaluation of the child’s everyday behavior in such 
activities as crawling, walking, sitting up, standing, picking up and 
manipulating objects, recognizing colors and shapes, and acquiring the 
use of language. Among the most accurately established and extensive 
norms are those prepared by Gesell and his co-workers at Yale Uni- 
versity (14, 15), where hundreds of infants have been periodically 
examined in practically every type of behavior under carefully con- 
trolled conditions. Preschool tests present the child with simple, stand- 
ardized tasks, many of which involve motor coordination, the develop- 
ment of perceptual respontcs, and the understanding of spoken 
language. Among the best-known tests in current use for the preschool 
level are the Merrill-Palmer Scale (44) and the Minnesota Preschool 
Tests (19). 

A relatively early development in the history of intelligence testing 
was the construction of performance scales. It was soon realized that 
the Binct type of test, depending so largely upon language, is not 

^The 1939 revision of the Kuhlmann Tests of Menial Development represents 
an extensive restandardi/ation of items from various sources, particularly the Kuhl- 
manii-Binet and the Cicsell normative scales. 

^ It is interesting to note, however, that as early as 1887 a series of developmental 
standards and simple tests for judging the mental level of infants during the hrst 
three years was worked out by an American physician, Dr. S. E. Chaille. The con- 
cept of mental age seems to have been implicit in his treatment of the data, although 
this term was not employed (cf. 18). 
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suitable for testing illiterates, the foreign-speaking, the deaf, or those 
who have speech disabilities. Performance tests were designed to meet 
this need, as well as to supplement the Binet type of test for a better- 
rounded picture of the individual’s abilities. Among the earliest stand- 
ardized scries of performance tests was the Pintner-Paterson Scale 
(36), appearing in 1917. It consisted of 15 tests which could be 
administered without the use of cither oral or written language on the 
part of either examiner or subject. Blocks, pictures of simple objects 
or scenes, and geometric forms arc the principal materials of such 
tests. A more recently developed test is the Arthur Performance Scale, 
which originally consisted of a rcstandardization of ten of the Pintner- 
Paterson tests and of certain other tests taken from previously avail- 
able series. The Arthur Scale has itself undergone considerable revision 
and modification during the past twenty years, the Revised Form II 
having been published in 1 947 ( 1) . 

Mention may also be made of the recently developed Wechsler- 
Bcllevue Scale (48), consisting of both verbal and performance tests 
and designed especially for the testing of adults. In content the test 
represents a combination of the Binet type and the performance type 
of item. Norms arc given for ages 10 to 59. A special feature mf this 
test is the suggested qualitative analysis of specific response patterns 
in terms of traditional clinical syndromes, 

GROUP TFISIING 

All the intelligence tests discussed in the preceding section arc ‘'indi- 
vidual tests'” in the sense that only one subject can be tested at a time. 
Furthermore, owing to the nature of these tests, a highly trained 
examiner is usually required to administer them. Testing on a large 
scale could not be conducted under these conditions. Data on such 
problems as sex and race dilfercnccs, for example, which require the 
investigation of large samples, would be very slow in accumulating. 

The advent of the group intelligence scale was probably the chief 
factor in the widespread popularization of mental testing. Fhe group 
test is designed with a view to its general use. It is not only adapted 
to the simultaneous testing of large groups, but it is also relatively 
easy to administer and score. I'hc impetus for the development of 
group tests was furnished in 1917 by the pressing need of testing over 
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one and onc-half million men in the United States Army during World 
War I. A quick, rough classification in respect to intelligence was 
necessary for many purposes. Discharge because of serious mental 
defect, assignment to labor battalions requiring only low-grade work, 
admittance to officers training camps, and a number of similar prob- 
lems required a knowledge of the intellectual level of the soldier. 

Accordingly, a committee was appointed by the American Psycho- 
logical Association to devise a test suited to this purpose. The com- 
mittee consisted of five psychologists who were specialists in mental 
testing, and was under the direction of Robert M. Yerkes. All the 
available material on mental tests was examined for its suitability to 
the needs of the army testing program. An important source of such 
material was an unpublished group scale previously developed by 
Otis, wdiich he made available to the government. The final outcome 
of the research of the army psychologists was the Army Alpha and 
the Army Beta. The former was the more widely used of the two; 
the latter is a non-language scale, and was designed for testing illiter- 
ates and foreigners unfamiliar with English. 

After the close of World War 1, new intelligence tests were con- 
structed at a rapid rate. wSoon special tests were available for elemen- 
tary school children as well as kindergarten anct preschool levels, high 
school and college students, and unselected adults. Mental testing 
attained undreamed-of proportions. School teachers were now con- 
sidered to be qualified to administer the newly simplified tests. Large- 
scale sciK>ol surveys were initiated; college freshmen were tested as 
part of the routine of admi , ion, the general public became intelli- 
gence-test-conscious. and the iQ became a byword in everyday 
conversation. 

'I his sudden popularization and pi blicity proved to be a mixed 
blessing in the development of a measuring instrument which was still 
in its infancy. Despile the fact that the existing intelligence scales were 
still very crude, they were too often accepted as a finished product and 
an infallible guide. Analysis of results and eviiliuition of techniques 
were subordinated to the more alluring occupation of classifying 
people. Occasionally psychologists themselves were guilty of overhasty 
generalization. Data on the vari(ms problems of differentia! psychology 
were being amassed in a rush. Sweeping conclusions were drawn — and 
quoted. 
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THE MEASUREMENT OF SPECIAL APTITUDES 

The period between the two world wars witnessed many technical 
advances in psychological testing. The premature popularization of 
intelligence tests following World War 1 led to an inevitable rcactidn 
of skepticism among many laymejn, as exaggerated initial expectations 
remained unfulfilled. Jn the meantime, psychologists were taking stock 
of this new tool, and an intensive phase of “testing the tests” was 
ushered in. 

One of the principal results of such a critical study uf psychological 
tests was a shift in emphasis from the exclusive use of intelligence tests 
to the measurement of special aptitudes. This shift is especially appar- 
ent in the testing of older adolescents and adults, in whom separate 
abilities arc more clearly dillerentiated.^' Among the manifestations of 
this growing concern of psychologists with special aptitudes is the 
tendency to report sub-test scores on the various parts of intelligence 
tests. For example, the American Council Psychological Examina- 
tion,'^ administered annuall> since 1924 to the entering freshmen in 
many colleges throughout the country, is now scored so as to yield 
separate scores in the linguistic and quantitative parts. Prior to 1939 
only a single score on the total test was regularly reported for each 
student. 

Further evidence of the increasing recognition of the special apti- 
tudes which enter into “general intelligence” is furnished by the 
dcsnipllvc labels now attached to many tests o[ the type formerly 
known as “intelligence tests.” During the past two decades the desig- 
nation “intelligence test'’ has been commonly superseded either by 
terms which describe more precisely what the test actually covers or 
by terms which connote preliminary classification only. An example 
of the former is the term “scholastic aptitude,” now u^ed to describe 
many “intelligence tests” which were found to measure principally 
those aspects of intelligence demanded by school work. An example 
of the latter practice is furnished by the Army (General Classification 
Test (commonly referred to as the AGCT), which w^as developed in 
World War IT to serve the same general functions as the Army Alpha 
of World War 1 (cf. 6, Ch. 11). This test provided a rough, prelim- 

^’This increasing diffeienllation ot abilities with age will be discussed more fully 
in Chaptei 14 

The hill title of this test is Xmciican roimcil on Education Psychological Exam- 
ination for Uollc^^c rieshmen, but it ts commonly designated by the shoiter title. 
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inary means of classifying the recruits into five Army Grades accord- 
ing to their general ability to learn the various duties required in 
military life/*^ It was prepared in four equivalent forms, each requiring 
about one hour, including preliminary instructions, a forc-excrcise, 
ai!d the test proper given with a 40-minute time limit. This test con- 
sisted of verbal, numerical, and spatial items, arranged in order of 
difiiculty, and was given to every inductee who could read English. 
A later revision (AGCT-3), requiring about two hours, yielded sepa- 
rate scores in (a) verbal ability, (b) spatial comprehension, (c) arith- 
metic computation, and (d) arithmetic reasoning. This form, there- 
fore, also illustrates the diiTerentiation of total scores into sub-test 
scores discussed above. 

I’he clearest indication of the emphasis upon special aptitudes is to 
be found in the large number of special aptitude tests which have been 
developed in recent years. Such tests arc now regularly employed in 
individual guidance as well as in personnel selection. Although a gen- 
eral intelligence test is given as a preliminary instrument of classifica- 
tion for most jobs, such a measure is nearly always supplemented with 
more intensive testing in relevant areas. Many of these tests are 
custom-made for the specific job and arc tested locally through a 
direct follow-up of a typical group of new employees. The Army and 
Navy also made much more extensive use of special ability tests in 
World War II than in World War I, when such tests were virtually 
non-existent (cf. 6, Ch. 11). Special “batteries,” or combinations of 
tests, were constructed and assembled for pilots, bombardiers, range 
finders, radio operators, and scores of other specialized occupations of 
modern warfare. Tests of mechanical aptitude, clerical aptitude, motor 
dexterity, speed of reaction, visual and auefitory acuity under various 
conditions, perception of distance or depth, and code learning are 
among the many special areas covered in these batteries. 

It may also be of some interest to note that in a “poll of experts” 
(26), conducted in 1944, a representative group of psychologists in 
the testing field were found to be overwhelmingly in favor of con- 
tinued development in the direction of aptitude testing. Of the 79 psy- 
chologists who replied, 55 expressed the opinion that “most will be 
accomplished if psychologists concentrate on measuring separate 


A similar test was pieparcd by the Navy. Known as the Navy General Classifi- 
cation Tost, it consisted of 100 items similar to those in the original Aimy Alpha 
but expressed in naval terms and related to naval situations. 
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intellectual factors.” Only 5 believed that the further development of 
testing should concentrate primarily on the measurement of general 
intelligence; 7 put the emphasis equally on general and special apti- 
tude tests, and the remaining 12 gave no answer or an answer which 
could not be clearly classified into any of these categories.-^ It should 
not be concluded, of course, that this group of psychologists were 
dissatisfied with the current tests of intelligence. On the contrary, to 
the question, “In your judgment, how well do intelligence tests meet 
the practical needs for classifying people as to general mental ability in 
the army, in schools, and in industry?” over three-fourths of the group 
checked the reply, “Rather well, much better than is done without 
tests.” In the comments following this question, however, it was again 
apparent that the intelligence tests were regarded as instruments of 
preliminary or approximate classification, which could profitably be 
supplemented b} the measurement of special aptitudes. 

PFRSONAMTY 7 f.STS 

The extension of testing techniques from sensori-motor and “intcllec- 
luaf' functions to emotional and social characteristics is also a rela- 
tively recent development. An antecedent of current personality test- 
ing may be found in Kraepclin’s first use of the free association test 
on pathological cases and on persons who had been experimentally 
subjected to various influences such as fatigue, hunger, and drugs. 
Kraepelin (27) reported ihat all these agents increased the number of 
superticial associations. In 1S94, Sommer (40) suggested that mental 
disorders could be differentiated by means of the free association test, 
rhe use of (his test for a variety of purposes has persisted to the 
present day . 

The most familiar personality tests, however, arc those employing 
standardized qucstii>n}iai}C i>r raiifio scale methods. These methods 
were originally dexcloped by Galton, Pearson, and Caltell for other 
purposes. The first systematic application of such techniques to per- 
sonality testing is t(^ be found in the Woodworth Personal Data Sheet 
(cf. 45, Ch. 5), an inventory constructed during World War I to 
detect neuroticism among soldiers. Although the armistice was signed 

'* One also wondcis v\'hcllicf then' is any sifinificance in tlic fad that those icsponcl- 
enls who emphasized the development of jienciul intcllis:enee tests weic considerably 
oldci than those v.ho jjave picccdcncc to the testing of special aptitudes-' 
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before the final form of this questionnaire could be widely applied, it 
was subsequently used in army hospitals and in civilian testing. Sev- 
eral revision^ and adaptations of the Woodworth Ouestionnairc have 
appeared, including forms especially suited for children, and for col- 
ltij;e students. Tests of the same type have also been developed for 
other social and emotional charactegstics, such as introversion-extro- 
version and ascendance-submission. The adaptation of this technique 
to the measurement of interests and attitudes represents a still more 
recent ramification of personality testing. 

In certain areas of personality, pcrjornunice tests have been devised, 
the best known probably being the Hartshorne and May tests (21, 
??, 23 ) for measuring character traits in school children. T'hcsc com- 
prise an extensive series ol tests covering such behavior as cheating, 
lying, stealing, cooperation, persistence, and inhibition. All these tests 
are administered in everyday-life situations, including classwork, as- 
signed ‘'homework,” atlilctics, or party games. I'he children are not 
aware that they arc being tested or that their behavior can be detected 
or identified with them. Applications of these techniques to adults 
have been made from time to time in te^«ts of such behavior charac- 
teristics as honcslv, suggestibility, and persistence. 

A type of personality test which has been gaining prominence in 
recent years comprises those tests which are commonly classified 
under the general heading of '^projeclivc fecJiniqnes ” In all such tests, 
the subject is given a task which permits of an almost unlimited 
variety of solutions. The assumption underlying these tests is that the 
subject will ‘'projccl” into I. s p»,rformance his characleristic thoughts, 
worries, fears, attitudes, and other emotional responses. Several differ- 
ent tasks have been used for this purpose, including drawing, the 
arrangement of small toy objects tc form a scene. cxU'mporancous 
dramatic play, the ranking ol photographs in order of preference, and 
the interpretation of inkblots or of ]Mcturcs. 

The most cxlensnelv publicized of these projective techniques is 
undoubtedly the Rorschach Inkblot Test, in this test, ten cards arc 
presented, each containing an irrcgiilr.rly shaped but symmetrical “ink- 
blot.” Five of the inkblots arc in black and gray, and the remaining 
five in several colors. The subject examines each blot and reports all 
the associations and meanings suggested to him by the blot. His re- 
sponses aie then interpreted through a detailed scoring procedure 
which takes into account such features as responding to the whole blot 
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or to separate details, color responses, associations involving move- 
ment, and originality of associations, as well as the specific objects 
perceived. Another type of projective technique which has come to 
the tore following World War II is the sentence completion test. Atten- 
tion was especially focused upon this technique as a result of its dse 
in the O.S.S. Assessment Program. Such projective techniques are 
largely in an experimental stage: their procedure and scoring are not 
yet fully standardized, nor is their diagnostic significance conclusively 
established. The entire field of personality testing is at present in a 
formative stage and its technical development is far behind that at- 
tained in aptitude testing. 

CURRENT TRENDS TN DIFEERENTIAL PSYCHOLOGY 

Any summary of current trends in differential psychology must begin 
with a recognition of the rapid and uninterrupted growth in the sheer 
number of psychologic cil tests, \ glance at any of the bibliographies 
of mental tests which have appeared during the past decade fields 
ample evidence of such expansion. In the 1939 bibliography of mental 
tests and rating scales prepared by Hildreth (24), for examplc^4279 
titles are included; six pages are needed to cover the bibliography of 
bibliographies on tests. The 1940 Mental Measurements Yearbook 
(8), covering primarily paper-and-pcncil tests published in English 
during approximately seven years, contains over 1500 titles. The third 
edition of this Yearbook, published in 1949, lists more than 600 addi- 
tional tests which appeared between 1940 and 1947 (8 ). Such growth 
in tests has, furthermore, been multi-dimensional, testing being applied 
to more and moic diliercnt aspects of behavior and employing increas- 
ingly varied techniques. 

A second trend, especially apparent during the past decade, is the 
development of methodological lefinemenls in both the construction 
and the application of tests. Not only are more rigorous techniques 
being devised for the selection and evaluation of test content and the 
establishment of norms, but tiiere is evidence of a growing concern 
with the experimental design in which tests are used. Even the pro- 
tracted controversies which have appeared in the recent literature on 
such issues as the effects of schooling (cf. Ch. 8) have helped to 
sharpen criticism and to focus attention on rigid standards of experi- 
mental control. The increasing practice of constructing tests to suit 
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the special needs of particular situations, as well as the practice of 
checking the diagnostic value of existing tests in the specific setting 
in which they are to be used, likewise reflect methodological progress. 
Mention may also be made of the trend toward more detailed analysis 
of* test performance with reference to sub-tests and even individual 
items, as well as the study of the possible diagnostic significance of 
patterns of responses within a test. 

A closely related trend, but one of sufficient importance to merit 
separate consideration, is the increasing frequency of longitudinal 
studies. The past decade has witnessed the completion of several major 
investigations involving ten- to twenty-year follow-ups of the same 
groups of individuals. A number of “growth studies” employing re- 
peated psychological as well as physical measurements are now in 
progress, some beginning with individuals at birth (cf. Ch. 9). 
Follow-up studies of intellectually gifted children (cf. Ch. 17), as well 
as studies on the prolonged effects of special training (cf. Ch. 8) illus- 
trate other applications of such an approach. 

Differential psychology is also gradually emerging from the initial 
stages of description to embark upon an active search for underlying 
explanatory principles. Hitherto unrelated facts from various branches 
of the field arc slowly being coordinated. More investigations are being 
designed with a view to studying the conditions under which individual 
differences develop or become modified. 

Finally, increasing interest is being shown in the fundamental nature 
of psychological traits (cf. Ch. 15). Theoretical discussion of the con- 
cept of “trait” has flourishca. Analyses of the specific components of 
“intelligence” and “personality” have multiplied. The combination of 
this growing theoretical sophistication with the development of method- 
ological refinements should forecast a productive future for the differ- 
ential approach to behavior phenomena. 
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PsYcnoroGiCAL TESTING AND STATISTICS are the principal tools of 
diflcrential psychology. As noted in the preceding chapter, it was the 
realization of the need for such tools in the study of individual differ- 
ences which led Gallon to the development of the first simple tests 
and to the establishment of the statistical laboratory which still bears 
his name. Subsequent developments in differential psychology closely 
paralleled the growth of the mental testing movement. Thus it should 
be apparent that an adequate understanding of the findings of differ- 
ential psychology presupposes familiarity with at least the basic con- 
cepts of psycliological testing and statistical method. 

It is obviously beyond the scope of the present book to cover cither 
of these fields. The reader is referred to any of the standard texts of 
psychological testing (cf., e.g., 7, 10) and of elementary psychological 
statistics (ef., e.g., 6, 9) for this purpose. Throughout the chapters 
which follow, how'cver, some consideration will be given to the essen- 
tial implications of the statistical techniques employed in the specific 
problems under discussion. Similarly, the present chapter will review 
the fundamental characteristics of psychological tests, without any 
attempt to survey spccilic available tests. Special attention will be 
given to the common prcKcdurcs of test construction, since a knowl- 
edge of such procedures is essential to the proper interpretation of test 
scores. 


BEHAVIOR SAMPLE 

Every psychological test is essentially an objective and standardized 
measure of a sample of the individual’s behavior. Any one test obvi- 
ously covers only a small sample of the type of behavior which it is 
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designed to explore. In this regard, the psychologist’s procedure is 
similar to that of, for example, a chemist who tests a shipment of iron 
or milk by analyzing one or more samples of it; from the directly 
measured characteristics of such samples the chemist then makes 
approximate deductions regarding the properties of the entire ship- 
ment. Similarly, when the psychologist wishes to measure the indi- 
vidual’s vocabulary, arithmetic ability, or hand coordination, he 
observe^ the individual’s performance with a limited number of words, 
arithmetic problems, or hand movements, carefully chosen to be lep- 
resentative of the total behavior segment he wishes to test. 

Obviously both the number and nature of the items chosen for any 
spccilic test will determine the adequacy of coverage of that test. 1 hus 
an arithmetic test consisting of only live items would scarcely be 
expected to constitute an adecjuate sample of the subject’s ‘‘arithmetic 
behavior.” Nor ccnjld a satisfactory measure of such behavior be ob- 
tained from a test composed exclusively of multiplication problems. 
The diagnostic or predictive significance of any test depends upon the 
degree to which it serves as an indicator of a wider range of similar 
behavior. It is only in this sense that the psychologist can test behavior 
over and above that which he is directly measuiing. For examfile, it 
might prove possible to predict an individual's ability to learn French 
from his performance in a one-hour test in the learning of an artificial 
language. If such weie the case, wc might say that the individual’s 
“capacit)” to leain French had been tested beloie he had even begun 
to studv French. It is only through such sampling of relevant behavior, 
however, that a psychological test can predict “capacities.” Contiary 
to certain [Hipular notions, mental tests have no special pow'ers for 
penciratiiie bcMnicI obseivahlc behavior into a dark realm of hidden 
potentialities iind latent aptitudes. 

SI VNOVROI/MION 

If the results of a psychological test arc to have any value in predict- 
ing or diagnosing behavior, it is essential that the testing procedure be 
thoroughly standardized. Ihe standaidization of a test consists in the 
establishment of unilonn couditkms for administering the test to all 
individuals, as well as a uniform method for evaluating respoi ses. It 
will be noted that this is simply a special application of the general 
requirement of controlled conditions in all scientific observation. The 
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one variable in a test situation is the individual who is being tested. 
Only if all other conditions are kept rigidly constant can the differ- 
ences be attributed to the individual himself, who is the sole variable. 

rhe layman generally has little realization of the degree to which 
UTfiformity of conditions must be maintained in a testing situation. It is 
largely for this reason that psychologists are skeptical of scores ob- 
tained by untrained examiners. It is not enough that the prescribed 
time limits be observed to the second and that the exact wording of 
instructions furnished in the test manual be h>llowed. Such factors as 
the rate at w'hich the directions arc read to the subjects, the vocal 
inflections, pauses, and facial expressions which accompany them, 
ar-vl the exact placement of demonstration materials also atlcct the 
subject’s performance. Altering tmy of these conditions may materially 
increase or decrease the difliculty of a test item for the particular 
person being tested. Such disturbances as undue fatigue or discom- 
fort of the subject and distractions from persons w'tilking about or 
from noises should obviously be avoided. Ihe examiner must also 
make some effort to obtain the proper '‘rapport" with the sub- 
jects. This means that before beginning the test proper, he must arouse 
the subjects’ interest and cooperation, make certain that he has their 
attention, and in every way insure that each subject will perform to 
the best of his ability. One of the advantages of individual over group 
tests is that with the former it is possible to establish rapport more 
lully and to maintain the sut-jeefs interest throughout the test. 

NORMS 

The Concept of Norms. The proce s of standardizing any psycho- 
logical test includes not only the esla‘ -lishment of uniforn:. procedure 
of test administration, but also the objective determination of norms. 
Without such norms it is impossible to interpret or evaluate the sub- 
ject's performance on the test. By checking the subject’s response on 
each item against a scoring key, the examiner determines the nue 
score. This raw score may be the total number of correct items, or the 
time required to complete a specified task, or some other objective 
measure of response appropriate to the content of the particular test. 
Such a score, however, has little or no meaning in itself. Psychological 
tests have no arbitrary or predetermined standards of 
“failing’', the individual’s performance can be evaluated only in ref- 
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erencc to the performance of other comparable individuals who have 
taken the same test. If, for example, an individual correctly identifies 
75% of the words in a vocabulary test, such performance may be ex- 
cellent, or just fair, or quite inferior. The question cannot be answered 
without reference to the norms for the particular test. 

As its name imp^ cs, a norm ^represents the “normal” or average 
performance on a specific test. A test designed for 8-year-olds, for 
example, must first be administered to a large, representative group 
of 8-year-olds, in order to determine what is the average 8-year-old 
performance. If on this test the average 8-ycar-old completes 6 out 
of 15 problems correctly, then a raw score of 6 becomes the 8-year 
norm on this test. Once established, such a norm can be used in the 
future in evaluating the performance of any 8-year-old child who 
takes this test. In actual practice, norms provide not only the average 
score but also the relative ircqiienc> of vaiying degrees of deviation 
above and below the average, thereby permitting a more precise eval- 
uation of scores throughout the entire range. 

A word may be added regarding the interpretation of scores on 
personalitv tests. In such tests, theie arc generally no “right” or 
“wrong” answers. Consequently in the scoring key the responds are 
simply classified with relerencc to ccrUiin categories — such as “as- 
cendant” or “submissive,” lor example — with no implication that cither 
category is right or wrong. The c(,mcept of norm, however, is defined 
in essentially the same terms tor personality tests as it is for intelli- 
gence or special aptitude tests. A personality test norm is an objec- 
tively determined average; it is not an ideal or perfect score, nor is it 
predetermined. For example, the norm on an emotional adjustment 
questionnaire might be icpre‘>ented by a raw score of 12 neurotic 
symptoms On tests of such characteristics as inlnwcrsion-extroversion 
and ascendance-submission, the norm generally tails at a point midw^ay 
between the two extiemes. As in the case ot all types of psychological 
tests, such norms arc iound by administering the test to a large sam- 
pling of people representative of the population on whom the test will 
ultimately be used. 

Common Types of Norms. Norms have been vai iously expressed. 
Among the best known is the mental age norm, first developed by 
Binet. Age scales, such as the Binet and its derivatives, group the 
separate tests or items into age levels in terms of the performance of 
the subjects in the standardization group. Thus a test passed by the 
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average 7-year-old ^ is allocated to the 7-year level; one passed by the 
average 11 -year-old, to the 11 -year level. When the final standardized 
scale is administered to an individual subject, his score is expressed in 
terms of the highest year level he is able to reach. In actual practice, 
an individual rarely passes all tests at and below a given year level, 
while failing all those above it. A certain amount of “scatter” of per- 
formance over adjacent year levels is the usual pattern. In such a case, 
the individual is given additional partial credits (expressed in months 
or fractional parts of a year) in the determination of his final men- 
tal age. 

In order to furnish a score which will be comparable at different 
ages, the mental age scores arc generally transmuted into a relative 
measure or quotient, the most familiar of which is the Intelligence 
Quotient (IQ). This is simply the ratio of mental age to chronologicak 

age - ^IQ = 10-ycar-old child obtains an MA of 10, 

his IQ will be exaetly 100. When the MA is lower than the CA, the 
10 will be below 100. IQ’s above 100 signify that the individual’s MA 
is higher than his CA, or that his test performance equals that of chil- 
dren older than himself. 

An IQ of 80 or one of 120 represents the same degree of retarda- 
tion or of acceleration, respectively, at all ages. This is not true of the 
mental age unit. Thus a retardation of one year in MA at age 4 is a 
more serious degree of backwardness than a retardation of one year 
at age 12. This follows from the fact that intellectual development, 
as measured by such tests, i. moie rapid during early life and exhibits 
a gradual slowing down with increasing age. There is a more marked 
difference between the tested abilities • -f the average 3- and 4-year-olds 
than there is between J 1- and 12-yci r-olds." Actually, the child who 
is one year retarded at the age of 4 will in general be three years re- 
tarded when retested at age 12. Under these circumstances, the IQ, 

^ In Ihc actual construction of an aue scale, the per cd'J of subjects of each age who 
pass tests assigned to that ycai knci vanes with age. This pM cent must be greater at the 
lower ages than at the upper ages in order to vrehi the increasing variability of MA re- 
quired for a constant IQ. In the Slanford-Binct, u r yainple, the per cent of at-age passes 
drops fiom around 11 at age 2 to slightly below 50 at the .'werage adult level. Even 
lower per cents of passes are used for the items at the superior adult levels in order to 
provide adequate ceiling. 

" It is customary to multiply the resulting quotient by 100 in order to avoid 
decimals. 

Because of the inequality of successive mental age units, they cannot properly 
be averaged. This is especially true of mental ages which are far apart. 
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being a relative measure, will remain constant. In the present example, 
the child's IQ at age 4 will be 3/ 4, or 75; at age 12, it will be 9/ 12, 
or 75. Retest studies have shown that mental age deviations from the 
norm do in fact increase with age and that the IQ consequently tends 
to remain approximately constant, provided that the individual is 
not subjected to drastic environmental changes or other unusual 
conditions.'^ 

One of the minor irritations to which psychologists arc repeatedly 
exposed is to hear intelligence tests gcnerically described as “IQ tests.’’ 
Apart from the lact that the 10 is a score and not a type of test, it 
should be dearly recogni/ed that all intelligence test scores arc by no 
means IQ’s. In fact, outside of a clinical situation, the ma)ontv ot intel- 
ligence test scores are likely to be cxpiessed in some other form 
Gtoup tests, for example, generally >ield scores which do not lend 
themselves U'j expression in terms ot IQ.' Similarly, tests designed ior 
adults do not as a rule employ the IQ. 

A briel consideration of the method for obtaining tlie IQ oi an 
adull will demonstrate why the IQ concept is ol little value m adult 
testing. In the first place, on an age scale such as the Stanlord-Binet. 
the average individiial's perlormance does not impro\e signilTcanlly 
beyond age 1 5. I his means that the average 2 l->e<ir-old or 3i)-year-old 
will do no better on the Stanford-Binct than Ihe average 15-\ ear-old. 
In Older to compute an adult IQ on such a test, therefore, a divisor of 
15 IS used in place ot the individuars actual chronological age. Such a 
procedure will )ield an IQ of 100 for an\ adult whose MA is 15, i e., 
whose test pcilormanee cqiuds that ol the average adult. It will be 
recalled, however, that the latest revision ol the Stanlord-Binet has 
three Superior Adult levels ol increasing diOiculty. II an individual 
were to pass all tests on the scale, through Superior Adult Level III, 
his MA would be 22 years and 10 months. Such a score is obviously 
a menial age on!> in an l xtrapolated sense and cannot be interpreted 
in terms ol the oiiginal concept ol mental iige. A mental age ol 8 rep- 
resents the ability ol the average 8-year-old, but a mental age of 22 
docs not represent the ability ol the avciage 22-year-old — the average 

^The pioblcm of ihc L()iistain.> ol Uil IQ will be covcicd moie fully in Chaj)tcis 
8 and 9 

-pcciijcally a niaior re for the isc of the IQ is tint ihe extent of 

individual diflcienets in M\ incc^sc systcniat callv wnh age. This requircinent is 
raiely if c\ci met by group It 
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22-year-old actually has a mental age of 15! In view of the above con- 
siderations, it is not surprising to find that the use of the IQ has been 
largely superseded in adult testing by one of the other types of scores 
to be discussed below. 

*A second type of norm is the percentile norm, based upon the per 
cent of individuals in the standardization group who fall below a given 
score. For example, if bSVr of the standardization subjects score below 
15 in an arithmetic reasoning test, then a raw score of 15 on this test 
corresponds to a percentile of 08. Anyone who completes 15 problems 
correctly, but no more, would receive a percentile score of 68. If w^c 
know that an individual has received a percentile score of 68 on any 
tc\l, we can cc>ncludc that his performance excels that of the lowest 
68 of the standardization group for that test. The 50th percentile on 
any test is obvit)Lhly the midpoint, or median score. A zero percentile 
w'OLild signify a score as low as. or lower than, any of those obtained 
in the standardization group; a lOOtli percentile, a score higher than 
any obtained in the standardization group. The former need not indi- 
cate the failure to complete any items; nor does the latter necessarily 
represent a perfect score. Percentile scores should not be confused 
with the familiar percentage scores. The former arc expressed in terms 
of people, the latter in terms of the number of itefus which the indi- 
vidual completes correctly. An ordinary percentage score is regarded 
as a raw score in psychological testing and is meaningless w'ithout 
reference to the norms. 

Percentile norms furnish a convenient means of determining roughly 
the individual s relative stanumg in one or more tests. They are prob- 
ably the most common type of norm employed with group tests. Cer- 
tain cautions should, howcNcr. be o..,crved in the use of percentile 
scores. Fhc chid point to bear in mind regarding such scores is that 
they are essentially ranks and arc therefore subject to whatever limi- 
tations apply to ranks Pcicenlilc scores are measures of relative posi- 
tion, not of amount. For this reason, we cannot assume that successive 
percentiles represent equal diU'erenccs in performance. In fact, w'e 
know that successive percentile scores .a.ar the mean, i.c., around the 
50lh percentile, corrcs[)ond to much smaller ability differences than 
do percentile scores at the* cAtremes of the range.^’ For example, the 

•’‘Tins (()II(,v\s I'loin \ht' noimal d.slnbulion ol peiformanco and will be discuss)ed 
fiinhci in ("hcipicr 3. 
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distance between the 90lh and 91st percentiles covers a much greater 
gap in performance than does that between the 50th and 51st per- 
centiles. Ii follows from this inequality of units in different parts of 
the percentile scale that percentile scores cannot properly be averaged. 
This is particularly true at the extremes of the distribution, where the 
discrepancies in units become conspicuous. 

A more precise and universally applicable technique for reporting 
test norms is in terms of standard scores/ Such scores use as their 
unit the standard deviation (SD or a) of the distribution of scores of 
the standardization group. The standard deviation is a measure of the 
extent of variability or individual differences within a group. It is 
found by subtracting each individiiars score from the group average, 
squaring each of these individual deviations, and then obtaining the 

square root of the average of these squares ^SD -= m . Once 

computed for the particular group, the SD furnishes a convenient unit 
for indicating how tar above or below the group average each individ- 
ual falls. Let us suppose that the average raw score of the standard- 
ization group in Test A is 60 and the SD is 5. An individual with a 
raw score of 70 on Test A would then be two standaid devrations 
above the average of this group (70 -- 60 - 10; 10 5 - 21. Such an 
individuaPs standard scofc is said to be 2 Similarly, a raw score of 
55 on 1 cst A would correspond to a standard score of —1, and a raw 
score of 58, to a standard score of —0.4. A raw^ score of 60 would cor- 
I'cspond to a standard score of 0, which alw^ays indicates the mean of 
the group. 

A ireqiient practice, designed to avoid the use of decimals and 
negative quantities, is to convert the standard scores into a more con- 
venient scale by adding an arbitrary constant and applying a constant 
multiplier to each score. A good illustration of this proceduie is fiit- 
nished by the scoring ol the Army Gencial riassification Test. In 
effect, the scores on the ACiCT were standard scores which had been 
multiplied by 20 and added to 100. To put it differently, the raw AGCT 
scores were converted into standard scores in a distribution with a 
mean of 100 and an SD of 20, rather than the usual mean of 0 and 
SD of 1 , which is implied in the simple conversion into standard scores. 


Also icfcrrcd to .is “sipni.i sloics*’ and “/-scores 
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Each of the five Army Grades into which the inductees were classi- 
fied corresponds to one standard deviation unit in the AGCT dis- 


tribulion, as shown below (cf. 2, 11). 


Army Grade 

Distance from the Mean 
in SD units 

AGCT Standard Scores 

1 

+ 1.5 and above 

130 and above 

11 

+ 0.5 to +l.< 

110 to 129 

Til 

- 0.5 to -f 0.5 

90 to 109 

IV 

-1.5 to -0.5 

70 to 89 

V 

Below 1 .5 

Below 70 


It will be noted that a score of 100 was arbitrarily selected in this 
conveited scale to correspond to the average performance of the 
standardization group.'" Every 20 points above or below 100 repre- 
sents one standard deviation above or below the group mean, respec- 
tively. Thus an individual with an AGCT standard score of 146 could 
he immediately recognized as being exactly 2.3 SD's above the group 
mean (146 — 100 46; 46^20—2.3). Such converted standard 
scores combine the advantages of convenience with wide applicability 
and precise interpretation. These reasons explain why current tests are 
making increasing use of standard score norms.’’ 

The Specificity of Norms. A requirement which is all too often 
overlooked in the application of norms is the comparability of the 
standardization group to the subjects on whom the test is to be used. 
It is not suilicient to know that norms were found on a large sampling. 
The nature of this sampling must also be taken into account in deter- 
mining the uses to which the test may be put and in interpreting the 
scores. Very few tests arc standardized on the general population. The 
Stanford-Binct and the AGCT probably come closest to these con- 
ditions. The former was standardized on a sampling which covered 
the school population in America quite adequately, although the selec- 
tion of subjects at the lower and upper age ranges of the scale was 
not so representative; the latter furnishes norms on an unusually 

Unfortunately, because ol this supeificial similarity to the IQ scale, many laymen 
mistook the A(K'T score,' (or IQ's, thereby conliibuling to the popular misuse of 
the latter teiin. 

‘‘For the statisticallv tuincd reader, slanda d scores O^-scores) should be differ 
cnliated from normah/ed standard scores (T-scoies). Altitoiigh the actual values ob- 
tained by the two methods do not usually differ very much, the assumptions under- 
lying their computation arc quite different. T-scores represent equal-unit scores in a 
normal distribution, while siandaid scores have the same form of distribution and the 
same inequalities of ditliculiy steps as the original raw scores. 
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extensive representation of American men of military age. Even these 
two samples, however, are obviously restricted. Age level is certainly 
one restricting factor in both, and other selective factors are un- 
doubtedly present in milder degrees. The large majority of tests arc 
considerably more restricted, their norms having been obtained ‘on 
much more narrowly defined populations than was the case in these 
two tests. Thus one test may be standardized on college freshmen, an- 
other on groups of applicants for three or four specific jobs, a third 
on children from the fourth to the eighth grade of elementary school. 

For many testing purposes, these more specific norms are of 
greater value. The test which undertakes to sample a more restricted 
population can, as a rule, do a more thorough job of sampling. At 
the same time, the comparison of an individual with a more clearly 
defined group to which he belongs permits a more significant inter- 
pretation of test scores. For example, it is usually of more practical 
importance to know how a college freshman'’s score compares with 
the average of freshmen in his own college than to know where the 
individual stands in relation to college freshmen in general. To com- 
pare such a score with the norm for the general population would be 
of still less value. Similarly, the personnel worker wants to knefw how 
a given applicant's test performance compares with the norms of 
applicants for the specific type of job to be filled, and preferably with 
norms obtained for this purpose ^in his owm company. 

It also follows that the scores from dilferent tests — which have 
been standardized on diflcrcnt samples — are not directly comparable. 
Even when the populations for which the tests were designed arc 
superficially the same, dilTcrcnccs in the specific samples employed 
may significantly alter the meaning of the norms. Such differences were 
vividly illustrated in an analysis carried out in the Harvard Cirowth 
Study (4). Out of the total group of school children \who were re- 
tested annually over a twelve-year period in this study, complete test 
records were available for 320 subjects. T hese subjects, all of wliom 
had taken nine common group intelligence scales as well as the 
Stanford-Binet, served as the basis for a comparative study of the 
interpretati(^n of an IQ derived from the dilferent tests. 

This analysis revealed considerable variation in the meaning of IQ's, 
not only from dilferent tests, but aiso from dilferent levels of the 
same test. Both of these fluctuations in IQ arc attributable in large 
part to differences in sampling in the standardization groups. The 
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group used in the standardization of one test (or one age level of a 
given test) may have been somewhat brighter — or duller — than that 
used for another test or age level. Despite the efforts made to obtain 
rejirescnlative samplings in the standardization of these tests, the 
groups actually used, in at least some of the tests, evidently fell short 
of this requirement. 

For precise testing purposes, in which differences of even a few 
JO points may be important, it is essential to allow for such variations 
in norms. This is particularly true in longitudinal studies, in which 
presumably equivalent tests (or levels of the same test) are ad- 
ministered at different times to the same subjects. In the Harvard 
Cjjowth Study, the equivalence of lO's on different tests was deter- 
mined by reference to their relative position or percentile within the 
group under investigation. For example, the median 10 of the 320 
children was 94 on one test, 102 on another, and 110 on a third. 
These three lO’s were then considered to be equivalent on the cor- 
responding tests. Similarly, the 80th percentile of the group was rep- 
resented by TON of 108, 118, and 124, respectively, on the same 
three tests. In this case, an JO of 108 on the first test would ‘"cor- 
respond” to one of 1 18 on the second and to one of 124 on the third 
test. 

The variations actually found in this study are remarkably large. 
The authors concluded that, “an IQ of 100, which is commonly 
interpreted as indicating average ability and a position near the center 
of an unselected group, represents, on tests given for the first time, 
positions varying from the J9th to the 65th percentile . . . from one 
in the lower quarter of the group, representing an ability which is 
supposed to approximate dullness, to one near the upper third of the 
distribution, indicating biightness of a promising nature” (4, p. 134). 
It is thus apparent that an 10 — or any other type of test score — 
should be accompanied by an indication of the test upon which it was 
obtained. vSuch a score cannot be properly interpreted without full 
knowledge of the nature of the group from winch the test norms were 
derived. 


TEST RELIABILITY 

The Meaning of Reliability. The concept of reliability is of funda- 
mental importance in differential psychology. In all the varied applica- 
tions of this concept, its common meaning is consistency. As applied 
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to psychological tests, reliability denotes the consistency of the sub- 
jects’ scores upon retesting. If, for example, a given individual’s IQ 
on a certain intelligence test is 135 on one day and drops to 86 upon 
retesting a few days later, the test obviously has very low reliability. 
Any one IQ found with this test would have little or no diagnostic 
value. Such retest fluctuations are known as the error of measurement 
of the test. Every test score will show some error of measurement, 
individuals rarely performing in identical manner on two occasions. 
Such marked changes as those in the above hypothetical example, 
however, would render a test useless for practical purposes. Changes 
of such magnitude might be the result of inadequate standardization 
of procedure, poor rapport, and other administrative conditions. Or 
they might indicate that the test is unduly susceptible to extraneous 
influences, such as weather conditions, or to minor emotional fluctua- 
tions of the subject, which would raise or lower the score on a par- 
ticular occasion. 

The ReUability Coefficient. The usual way of reporting the relia- 
bility of a test is by means of the reliability cocilicient.^^^ This is the 
coefficient of correlation between test and retest scores of the same 
group of subjects. The correlation coefficient tr) is a single numerical 
index of the degree of relationship or correspondence between any 
two sets of measures. This coefficient can vary numerically from 
+1-00, a perfect positive correlation, through 0, to —1.00, a perfect 
negative or inverse correlation. A -|- 1 .00 correlation means that the 
individual receiving the highest score in one set of measures also 
receives the highest score in the other set, the one who is second best 
in the first is second best in the second, etc., each person’s relative 
standing in the two measures being identical. A —1.00 correlation, 
on the other hand, indicates that the highest score in one measure is 
paired off with the lowest in the other, a corresponding perfect reversal 
occurring throughout the group. A zero correlation signifies no rela- 
tionship at all between the two sets of scores, or the sort of arrange- 
ment which would result if the two sets of scores were shuffled and 
paired off at random. Perfect positive or negative correlations are 
very rare in actual practice, most of the coefficients falling on inter- 
mediate values. 

The measure of correlation most commonly used in psychology is 


a discussion of some of the implications of different techniques for the 
measurement of test reliability, cf. Thorndike (13). 
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the Pearson Product-Moment Correlation Coefficient. If, for example, 
we wish to correlate the scores on two administrations of the same 
test, A and B, by this method, we would find each person’s deviation 
(or difference) from the average score on Test A and multiply it by 
th^ same person’s deviation from the average score on Test B. The 
average of these products for the entire group is the correlation co- 
elficient.^^ It is obvious.that if those individuals who are above the aver- 
age in Test A are all above the average in l^est B, and those below in 
A are also below in B, then the products of the deviations will all be 
positive and the correlation will be positive. On the other hand, if 
most of the individuals who are above the average in A (i.e., positive 
deviations) are below the average in B (i.e., negative deviations), then 
the products of these deviations will be negative and their average 
will be negative, thus yielding a negative correlation. 

The computation of reliability coefficients is one of the many uses 
to which the correlation coefficient is put in psychological testing. 
Most tests in current use have reliability coefficients in the .80’s or 
90’s, Reliabilities which fall appreciably short of these values do not 
meet the standards of consistency needed for most testing purposes. 

The Role of Behavior Fluctuations. If we regard test reliability 
as an index of the consistency of the test as a measuring instrument, 
then perfect reliability is not at all inconsistent with lluctuations of 
responses on different occasions (cf. 1). A discrepancy in score on 
successive retests may simply mean that the test is serving its function 
as an accurate and sensitive index of actual changes in the subject. 
To take an example from a ^ dferent field, one does not measure the 
reliability of a thermometer by comparing temperature readings on 
different days. The thermometer ma; be perfectly reliable and still 
give very different readings on succe!,sivc days. Such fluctuations in 
daily temperature readings would be of interest if one wished to deter- 
mine the reliability of an estimate of daily temperature in a given 
locality, from a single day’s reading. It would, in other words, indicate 
the consistency of the temperature, not of the ihermometer. Similarly, 

Since it js essential that the two deviation he multiplied be in the same units, 
the actual computation of i involves the transr ulation of all scores into standard 
scores. Several foimulas are available for facilitating the compulation of this coeffi- 
cient, all of them automatically performing this transmutation. The most familiar 
Sxy 

foimula is: ^ which ^xy is the sum of the products of the deviations 

discussed above, ov and Or are the standard deviations of the tvo sets of scores, and 
N is the lotal number of cases in the group. Seveial other formulas have been 
developed which permit further computational short-cuts. 
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the discrepancies in score on different occasions may simply show 
how variable the subjects arc in the functions tested. 

This problem is not especially serious in the measurement of apti- 
tudes, since the individual’s abilities in arithmetic, vocabulary, 
mechanical comprehension, motor coordination, and the like are hot 
likely to alter appreciably fiom day to day. Hence fluctuations in such 
tests can be attributed to the “unreliability” of the measuring instru- 
ment and steps can be taken to improve or replace the test. If the 
interval between successive retests is one of several years — or even 
several months in the case of young children — then, of course, the 
changes in score cannot be attributed solely to the tests. Similarly, if 
the subject has undergone drastic changes in environment or in 
physical condition during the interval between testings, sharp rises or 
drops in test score may occur which have no bearing upon the re- 
liability of the tcsi,^“ In the absence of any unusual circumstances and 
with intervals of only a few' weeks or days between retests, how'cvcr, 
it is probably safe to expect aptitude test scores to remain approxi- 
mately constant. I'o demand high retest correlations in such cases is 
therefore justilied. 

In the held of personality testing, on the other hand, it is rca^^Miablc 
to suppose that an individual’s attitudes, dominance, sell-conlidence, 
and the like may vary appreciably even iwer short intervals. The 
susceptibility of such characteristics to the individuars experiences 
immediately preceding the test cannot be ignored. This is not meant 
to imply that personality characteristics do show daily lluctuations, but 
only that we cannot legitimately assume their constancy, and thus 
cannot ascribe all sources of variation in score, ipso jacio^ to imper- 
fections of the test. It is apparent that the retest method of determin- 
ing test reliability does not lend itself very well to personality tests, 
since a low' retest correlation in such a test would be ambiguous. 

The Sampling of ("ontenl. When a test is of such a nature that the 
subjects can recall some of their responses from the first to the second 
testing, or when an appreciable practice effect occurs in the course of 
the testing, then two equivalent jornis of the test arc administered 
rather than repeating the identical test. Such equivalent forms are 
composed of different items chosen as samples of the same abilities. 
The correlation between such equivalent or parallel forms thus de- 
pends both upon the day-by-day consistency of the scores and upon 

‘“F'oi a discussion of relevant material, see Chapters 8 and 9. 
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the degree to which each form actually samples the entire behavior 
area which is being measured. For example, if a list of 20 carefully 
selected words is used to sample the individuars vocabulary knowl- 
edge, how far will the subject's score change when he is given another 
vocabulary list chosen in the same manner but containing 20 different 
words? Obviously ilie longer the list of items the higher, in general, 
will be the consistency of the subject's performance on the second, 
equivalent list. Other things being equal, a longer test can sample the 
behavior in question more thoroughly than a shorter test. It is, in 
fact, possible to predict by means of a statistical formula approxi- 
mately how much the reliability cocllicicnt of a test will rise when the 
Xcsl is lengthened by specified amounts. 

Test reliability is sometimes defined exclusively in terms of the 
adequacy with which the test samples the behavior under considera- 
tion. Such a definition is implied by the common practice of comput- 
ing reliability by the spLt-iialf icchulqiie. This type of reliability 
cijcfiicicnt, sometimes known as the coefficient of internal consistency, 
IS universally applicable to all types of tests and is undoubtedly the 
most widely used. Not only does it avoid the recall of items on 
retests, but it also rules out any general effects of practice, fatigue, 
or similar cumulative factors. The split-half reliability is likewise unin- 
fiucnced by the possibility of “true'’ daily lluctuations, as in the case 
of personality tests discussed in the previous section. The procedure 
consists essentially of correlating two sets of scores obtained during 
a single administratit)n of a single form of the test. Perhaps the most 
common torm of the split-liaii procedure is the “odd even technique,” 
in which each subject's score on the odd items is correlated with his 
score on the even items. In this way, li^ither score has any appreciable 
advantage or disadvantage in terms of adaptation, practice, fatigue, 
boredom, difficulty level of items, or any other condition which may 
vary progressively during the test period. Other ways of dividing the 
test ciiii, however, be employed. In a speed lest, for example, the 
performance during the first and last quarteis of the test period is 
usually combined to yield one of the I'xO half-scores, while the per- 
formance during the two middle quarters determines the other. 

It will be noted that the 'correlation of halves” thus obtained 
actually shows the reliability of vmly half the test. If, for example, the 
entire test consists of 100 items and the reliability coelficicnt is com- 


’ ^ Known as the Speaiman-Brow^n formula — cf., eg., Garrett (6), pp 387 390. 
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piited by correlating scores on two sets of 50 items each, the obtained 
reliability will obviously be lower than that of the entire test. For 
this reason, it is customary to estimate from the correlation of halves 
what would be the reliability coefficient of the whole test, i.e., of a 
test double in length. The formula discussed above for determining 
the effect of lengthening a test upon its reliability coefficient is used 
for this purpose; the estimated reliability is accordingly designated 
the “Spearman-Brown reliability coefficient.” 

It is apparent that the three principal methi>ds for determining the 
reliability of a test differ not only in the range of situations to which 
they are applicable, but also more fundamentally in the aspect of 
reliability which they measure. The retest method indicates the degree 
of day-by-day consistency of performance; the equivalent-form method 
combines day-by-day consistency with adequacy of behavior sampling; 
the split-half method is based only upon adequacy of behavior 
sampling. 

The Effect of Range. The reliability coefficient, in common with 
all correlation coefficients, is influenced by the range of scores within 
the group on which it is computed. In general, the reliability co- 
efficient of a test will be lower when found on a group of a single age 
level than when computed on a group of varying age. The latter, more 
heterogeneous age group will exhibit a wider range of scores on most 
tests than will a group homogeneous in age, and will consequently 
yield higher correlation coefficients. The age range of a group is one 
of the most frequent sources of discrepancy in correlation co- 
efficients from sample to sample, and its influence should always be 
taken into account in evaluating the size of an obtained correlation. 
Any other factors which serve to increase or decrease the heterogeneity 
of a group may also raise or lower the size of a correlation coefficient. 

It thus follows that any given test has not one but many reliability 
coefficients. The same test will yield a high reliability coefficient in a 
relatively hclerogeneous group, a much lower one in a more 
homogeneous group. A measure which is relatively independent of the 
range of scores is the standard error of measurement In 

A fouith method which has come into prominence in recent years is the 
“method of rational equivalence" developed by Kiidcr and Kichardson (8). It is 
also basically a measuic of internal conMslenc> and therefoie an in(^c\ of the 
adequacy of behavior sampling. Its computation takes into account the intcrcorrela- 
tion of individual test items. 

’^Or the probable erioi of measurement, which can be obtained by multiplying 
‘he standard erroi of measurement by .6745 — .6745a„,onH )■ 
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the computation of this measure, both the reliability coefficient and 
the standard deviation of the scores of a particular group are em- 
ployed. The result will be in the same units as the test scores, and 
indicates the amount of “error” introduced into the score by the use 
of hn imperfect measuring instrument. In interpreting the a,„ons , we 
may say that the chances are approximately 2:1 that the obtained 
score docs not differ by more than the amount of the standard error 
from the individuaPs “true” score, i.e., the score he would have ob- 
tained on a measuring instrument with perfect reliability. To take a 
specific illustration, if a child’s 10 on a particular test is 113 and 
the Oniras of this test is 5 points, then the chances are 2:1 that the 
child’s “true” iO is between lOS and 118 ( 1 1 3 ~ 5 -= 108; 
113-1-5 - 118). 

VALIDITY 

The Concept of Validity. The validity of a psychological test is 
the degree to which the test succeeds in measuring, diagnosing, or 
predicting that area of behavior which it sets out to measure. In 
order to determine such validity, it is necessary to have an independent 
criterion measure of the behavior under consideration. For example, 
in validating a series of tests of musical aptitude, subsequent per- 
formance in music schools was the criterion employed. A test designed 
for the selection of taxi drivers would be validated against actual job 
performance of a typical group of applicants. A scholastic aptitude test 
for college freshmen would he checked against the students’ grades 
in college courses. In all these situations, a representative group, 
selected for validation purposes, is gi cn the test and then followed 
up to determine each individual's actual performance in the areas 
being tested. 

The follow-up thus yields a direct measure of that which the test 
is trying to predict through a small sample of performance. In this 
lies the answer to the apparent paradox of test validity. It might have 
been argued that if we need an independent and thoroughly reliable 
measure of that which the test seeks to measure, then why do we need 
the lest at all? The purpose of the test, however — once it has been 
validated on the trial group — is to predict within a short testing 
period that which would otherwise have been discovered only through 
a long and wasteful period of direct observation. If all applicants for 
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a job were indijicriminatcly hired, the sheep would eventually be 
separated from the goats through actual success or failure on the job. 
Within a period of several years, most of the incompetent workers 
would probably have been eliminated. Such a process of selection 
would obviously be absurdly costly both to employer and employee. 
Similarly, all students who wished to enter medical school might be 
admitted, on the undoubtedly correct assumption that those lacking 
the proper qualifications would eventually “flunk out.” It is the primary 
object of psychological tests to approximate, in advance, the type of 
judgment which would otherwise require a virtually prohibitive trial 
period. Once the validity of a test has been established, there is, of 
course, no longer any need for a criterion measure on the subjects 
with whom the test will be used. 

The validity of a testes most commonly reported in terms of the 
validity coefficient, i.e., the correlation between test scores and cri- 
terion measures in the validation group. All the factors which may 
alfect the size of a correlation coclTicient should obviously be taken 
into account in evaluating such a validity coelficient. Thus the size 
and nature of the group upon whom the correlation was computed 
must be ascertained. A validity coefficient obtained on a small itumber 
of subjects is of little significance, since the correlation may vary 
widely when a difl'erent sampling is employed. The type of subjects 
upon whom validity was determined should be similar to those on 
svhom the test is used. It is now a growing practice to redetermine the 
validity of published tests on the population with which the test is to 
be used. Not only norms, but also the validity of a lest, arc specific 
to the population. A test may be a good measure of intelligence for 
machinists and a poor one for college freshmen. Or the same test 
may actually prove to be a sati.sfactory test of a difl'erent function 
when applied to a group unlike the original validation group. 

Also relevant is the consideration of the range of ability repre- 
sented by the group upon whom the validity coeflicient is computed. 
This is the same problem which was discussed in connection with 
the reliability coelficient and it can be met by a similar solution. In 
place of the standard error of measurement, wc can now computeThc 
standard error of estimate (a.,,! ). This is a measure of the amount of 

U- For example, a well-known lest of clerical aplilLide proved to be a i oor instru- 
ment for the selection of olUce clerks in a paiticuku company, but was successful as 
a means of selecting a certain type (^f routine factory worker. 

As usual, this can also be cxpiessed as a piubahlc cno}: — .6745 
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error which is introduced by virtue of the fact that the individual’s 
performance is predicted or estimated from the test score rather than 
being directly measured through the criterion. The is based upon 
the SD of the criterion measure and the validity coedieient of the test. 
It is expressed in the units in which the criterion is measured. For 
example, if scholastic grades are to be predicted and if such grades are 
expressed on a percentage basis, then the a, .si will be in percentage 
units. Thus if a student’s predictecl grade on such a scale is 78 and 
the t^f the test from which the prediction was made is 4, then the 
chances arc 2:1 that the student’s actual grade will lie between 74 
and 82 (78 ~ 4 - 74; 78 -|-4 -- 82). 

Ill a number of situations, tests are used not to predict the exact 
performance or relative position of each individual, but rather to 
select those individuals who arc most likely to reach or exceed a 
certain minimum standard of achievement. 1Tis is particularly true 
of the industrial use oT tests, in which the test serves as a '^screening' 
device, and '*cnt-off scores" are often of more interest than validity 
coehicients. The validity of a test can be most clearly determined in 
such situations by the method of contrasted groups. For example, a 
group (T applicants who have taken the test under consideration are 
hired and followed up for a year or longer. At the end of this trial 
period, each individual is classified as a ‘"success” or “failure” in 
terms of whatever practical standard the company employs for 
evaluating job performance. The initial test scores of these two con- 
trasted groups arc then compared: the greater the difference in such 
scores between the two groups, the greater the validity of the test for 
the purpose of selecting successful applicants. From an examination 
of the distribution of scores made b> the two groups, a “cut-off” 
point or minimum score is set at a point which would eliminate the 
largest possible proportion of “failures’' and at the same time exclude 
the smallest possible percentage of “successes.” 

Illustrations of test validation taken from the military use of tests 
in World War II arc shown in Figures I, 2, and 3, the first two dealing 
with the validity of the AGCT and the ;Iiird with the validity of a 
special battery of tests designed for the selection of pilots. Figure 1 

Knowing the validity coefficient of a test, the per cent of applicants who will 
need to be hired, and the per cent of “successes” on the particular job prior to the 
use of the test, ii is possible to compute the per cent of those selected through the 
lest who will succeed (cf, 12), In this way, the net gain in selection accuracy 
attributable to the use of the test can be detei mined. 
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indicates the extent to which the AGCT predicted the success of 
officer candidates, the criterion being the actual commissioning of 
the men at the completion of their officer training course. The data 



Fig. 1. Predicting Success of Officer Candidates from their Scores on 
the Army General Classification Test. (From Boring, 3, p. 242.) 

of 5520 officer candidates from 14 schools reveal a fairly close degree 
of correlation between criterion and test score. Thus of the men 
scoring 140 or higher on the AGCT, over 90% received their com- 
mission. At the other extreme, less than 50% of those scoring below 
110 succeeded in obtaining the commission, although they had gone 
through tlie same training course. Figure 1 shows very clearly the 
reason for setting 110 as the cut-off point for subsequent admissions 
to officer training courses. While over half of those scoring below 
110 failed, the number of failures dropped sharply to less than a 
quarter in the group scoring 110-119. 

Similar data for tank mechanics are given In Figure 2. Grades 
obtained in a tank mechanics course constituted the criterion for 
this group. Subdividing the group into the five Army Grades on the 
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Fig. 2. Predicting Success in a Tank Mechanics Course from AGCT 
Score and from Schooling. (From Boring, 3, p. 251.) 

AGCT, we find an even closer correspondence between test score and 
criterion than was found in the ofiic'T group, a fact which is un- 
doubtedly attributable in part to the greater heterogeneity of the 
tank mechanics. It will be noted that among the men scoring below 
70 on the AGCT, less than 10% received an above-average grade in 
the tank mechanics course. This percentage rises in each successive 
Army Grade until we find over 80% of the men in Grade I receiving 
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an above-average grade in the course. For comparison, the data on 
schooling have been checked against the same criterion of success in 
the tank mechanics course, with the results shown in section B of 
Figure 2. Some correspondence exists here also: among the men who 
had completed only six grades in school, less than 30% did better- 



Fig. 3. Validity of a Pilot Selection Battery. (From Flanagan, 5, p. 58.) 


lhan-average work in the tank mechanics course, this percentage 
rising slowly but consistently until it reaches slightly more than 60% 
in the group that had had two years of college training. It is apparent, 
however, that success in the lank mechanics course correlated more 
closely with AGCT score than with schooling. 

The data reproduced in Figure 3 show the effectiveness of a series 
of tests used by the Army Air Forces in the selection of pilots. The 
object of the tests was to predict pilot success prior to any flying 
training. The major criterion employed for validation was completion 
of primary pilot training, although some corroborative data on sub- 
sequent performance in advanced training was also obtained. Through 
an initial analysis of the requirements of the pilot’s job, together with 
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preliminary follow-up studies, a battery of tests was assembled, which 
consisted of 6 apparatus tests of coordination and speed of decision, 
and 14 papcr-and-pencil tests of intellectual abilities, perception and 
visualization, and certain personality characteristics. The scores on 
each of these tests were weighted so as to give the best possible 
estimate of the criterion. The sum of the weighted scores was expressed 
in terms of a nine-point scale. The term “staninc” (standard nines) 
was coined for the units on this scale, a stanine of 1 representing the 
lowest level of estimated pilot aptitude and a stanine of 9 the highest. 
Reference to Figure 3 shows the degree to which this battery suc- 
ceeded in separating the successes from the failures in primary pilot 
training, in a group of 185,367 men. The horizontal bars indicate the 
proportion of men in each stanine who were eliminated because of 
flying deficiency, fear, or at their own request, ft will be seen that 
only 4% of those in the top stanine were eliminated, in contrast to 
11% in the lowest stanine. The per cent of failures follows a uniform 
progression between these two extreme groups. As a result of such 
investigations, the cut-off score for admission to pilot training-” was 
subsequently set at pilot stanine 7. 

The Validation of General Intelligence Tests, The validation of 
special aptitude tests is a relatively clear-cut procedure since the cri- 
terion is usually definable in fairly specific terms. Such is also the 
case when validating general intelligence tests for use as instruments 
of preliminary classification or as part of a battery for the prediction 
of performance in a particular occupation, course of study, and the 
like. Most intelligence tests, however, are still constructed for the 
relatively general and vaguely defined purpose of measuring ‘'intel- 
ligence,'’ without reference to any sp cific situation. The problem of 
finding a suitable criterion for sucu tests presents mjrc serious 
difliculties. 

A number of special criteria have been developed for the validation 
ot tests of general intcHigence, more than one criterion frequently 


“Sianincs" arc normalized sUindard scor s, or “T-scoies'’ (cf. footnote 9). The 
lov'csi 4^, of ilic dii'.ribution icccivcil a Manint . f T, the next 7';. 7, the follov^ing 
12". , 3; the next 17',: , 4, and the lollowing 20S' , 5, which c^'iiespondcd to the mean 
of the distribution. Corrcspondinii percentai»es above the center of the distribution 
( 17, 12, 8, 4 I were assigned slamncs of 6 to 9, lespectivcly. 

Similai battencs were constiiKlcd to yieU! slanmcs for navigatois and other 
AAF gioups The specifk'ily of aptitudes was v«\ idly demonstiatcd by the observation 
that men with high navigatoi siaiimcs, lor example, often had pilot slanines as low 
as 3, 4, or 
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being applied in the evaluation of a single test. For age scales such as 
the Stanford-Binet, as well as for preschool and infant tests, age 
differentiation is a major criterion. The degree to which the test dis- 
criminates between successive age groups, as well as the correlation 
between chronological age and score, are taken as indices of validity. 
It should be noted that the satisfaction of such a criterion indicates 
merely that the test measures behavior characteristics which tend to 
increase with age under existing conditions and in the type of 
environment in which the test is standardized. 

A second, commonly used criterion consists of teachers’ ratings, 
school grades, or other indications of quality of academic achievement. 
It is because so many current intelligence scales have been validated 
chiefly against school achievement that they are frequently described 
as tests of scholastic aptitude. Ratings by supervisors, such as em- 
ployers, shop foremen, army officers, and the like often serve as criteria 
for adult tests. Amount of education is sometimes introduced as a 
criterion on the assumption that the successive rungs of the educa- 
tional ladder serve as selective factors, progressively eliminating those 
less able to profit from the more advanced types of instruction. With 
an extensive system of public education, such as is available in 
America, this assumption is partly correct, although at the higher 
educational levels other factors besides ability undoubtedly affect 
survival. 

Various applications of the method of contrasted groups are also to 
be found. The comparison of the scores of persons in different 
occupational levels is an example of such a method. Another illustra- 
tion is the comparison of the scores of unselected school children with 
those of institutionalized feebleminded subjects of the same age. In 
these instances, the criterion is ultimately based upon the com- 
posite demands of everyday life situations which determine survival 
in various occupations or in a normal, non-institutional environment. 
A closely related vivlidating technique is to correlate test scores 
with psychiatrists’ diagnoses as to whether or not the individual should 
be institutionalized for mental deficiency. Unless such diagnoses are 
based upon a prolonged observation period, however, this criterion 

2' By means of the biscrial coefficient of correlation, since the diagnoses are in a 
twofold category and the test scores in a continuous distribution. Such correlations 
have been repotted, e.g., in the validation of the Wechslei -Bellevue Intelligence Scale. 
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may itself be no more valid than the test and tlius serve no purpose 
in the process of validation. 

Frequently, correlations with other intelligence tests are reported 
as validity coefficients. The Stanford-Binet, for example, has often 
sci^ved as such a criterion. This procedure is justified only when the 
new test is a short and relatively crude instrument introduced as a 
practical time-saving device. It obviously assumes that the new test 
can do no better than approximate the results of the earlier test. For 
some tests, however, such correlations are reported not so much as 
validity coefficients but simply as a rough indication that the new 
lest is measuring approximately the same general area of behavior 
as other current tests dcsignaled by the same name. In such cases the 
logic of the situation demands that the inter-test correlation be 
moderately high, but not too high. An unduly high correlation be- 
tween SLicn tests would indicate needless duplication of effort, since if 
the two tests are so nearly the same, there is no point in introducing 
the second one. 

Finally, vc may consider the method of internal consistency 
whereby the total score in the test itself is used as a criterion. Indi- 
vidual items for a test are often chosen on the basis of their agree- 
ment with the total score. For example, items on the Stanford-Binet 
were selected in part by comparing performance on each item with 
10 on the entire scale. If, for instance, a given item was passed by 
approximately the same per cent of subjects in the lower and upper 
10 levels, it obviously was failing to discriminate between individuals 
who dilTered in those characi rislics measured by the test as a whole. 
On this basis, such an item would be eliminated from the scale. The 
degree to which the items finally selected correlate with the total 
score may then be cited as evidence « f validity. Another application 
cf this method is the selection and validation of the separate tests in 
a battery, in terms of the correlation of each test with the composite 
score on the entire battery. 

The method of internal consistency falls in the borderland between 
validity and reliability. In so far as it does not depend upon an out- 
side criterion, it does not, strictly speaking, yield a measure of 
validity. Jt may be argued, however, that any index of the adequacy 
with which a given behavior area is sampled is relevant to the concept 
of validity. It is certainly true that a test cannot be very valid if its 
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items do not adequately sample any one behavior characteristic, i.e., 
if the subject's performance is inconsistent from item to item and his 
score varies widely with the addition or deletion of a few items. On 
the other hand, although a lest may measure a particular behavior 
area with a high degree of consistency, and may have sampled it very 
fully, the behavior tested may not be that which the test purports 
to measure. It would be quite possible, for example, to devise a test 
which showed a high degree of internal consistency, but which did not 
differentiate between normal and feebleminded subjects. If such a 
test had been labeled an ‘'intelligence test,” it obviously would not be 
valid, despite its high internal consistency. This technique is thus of 
value only when used in conjunction with further validation by out- 
side criteria. 

The Validation of Personality Tests. The validation of personality 
tests presents even more of a problem than does the validation of 
intelligence tests because of the difficulty of finding a satisfactory 
independent criterion of most personality characteristics. A number 
of the techniques employed with intelligence tests have, however, 
been adapted for use in the validation of personality tests, with a 
moderate degree of success. One such technique is the corrclatfon of 
test scores with ratings by associates, teachers, job supervisors, and 
others who may have had an opportunity to observe the subject over 
an adequate period of time. In general, such criterion ratings for per- 
sonality characteristics should be made by more than one observer, 
in order to guard against individual bias and other idiosyncrasies of 
the raters. Similarly, care should be taken to insure that the raters 
have had ‘‘trait acquaintance,” i.e., that they have had the opportunity 
to observe the subjects in those specific aspects of behavior which 
are covered by the test. Correlation with psychiatric diagnosis has 
been employed in validating certain tests of emotional maladjustment. 
As in the case of inielligencc tests discussed cibove, such a procedure 
is satisfactory only when the criterion itself is based upon a careful 
and prolonged follow-up, rather than upon a cursory psychiatric 
examination which may be no better than the test being validated. 

Correlations with other personality tests have sometimes been re- 
ported as an index of validity. This again presupposes that the cri- 
terion itself has previously been established as valid. It is best adapted 
to the validation of tests which are introduced as abridged, lime- 
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saving versions of longer tests. For example, the Bernreuter Per- 
sonalily Inventory was originally designed to yield four separate 
scores within a 1 5- 2C)-minute testing period, each of which w^as an 
approximation of the score on a dilTerent, previously constructed test. 
Tlirough this test a rough estimate was obtained of the subject's score 
in neurolicism, introversion, dominance, and seif-sulhcicncy. Obvi- 
ously in this case the correlations of the Bernreuter scores with the 
scores on the four separate tests from which it was derived w^ould be 
relevant. 

I he method of contrasted groups is one of the most c<')mmon ways 
of checking the validity of a personality test. For example, occupa- 
tioiMl group may be the criterion, as when a test of extroversion or 
sociabilitv is given to, let us say, salesmen and mechanics. If the 
scores of ihc former group are clearly higher than those of the latter, 
some evidence will thereby iiave been furnished for the validity of 
the test. Delinquent and non-delinquent children have occasionally 
been used in a similar way to test the validity of certain character 
tests. Or the scores made by neurotics under treatment can be com- 
pared with those of a matched group of normal persons who have 
never been under psychiatric care. In connection with tests of 
neurolicism or cmolional insiability, the frequency of a response in a 
normal group is a further check. 1 hus if a particular behavior 
characteristic occurs in a largi* per cent of normal persons, it cannot 
by definition be an '‘abnormal" response. 

A relatively large numixr of personality tests, especially those 
of the questionnaire type, have relied exclusively or primarily upon 
the method of internal ciuisistenc y. For example, the 25% most 
introverted and the 259f most extro\/rled subjects in the validation 
group arc first selected on the basis of their total scores on a pre- 
liminary form of an introversion-extroversion test. The responses of 
the two groups on each item arc then compared. If a supposedly 
"introverted'’ behavior item (xxurs more often among the extroverted 
group than among the introverted, then such an item is discarded as 
not being properly diagnostic. If it ocea* ^ with about equal frequency 
in both groups, it is neutral or irrelevant and should likewise be dis- 
carded. To be retained, an '‘introvert” item must occur with a sig- 
nificantly higher frequency in the introverted than in the extroverted 
group. This method is subject to the same limitations discussed in 
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connection with its use in the validation of intelligence tests. The fact 
that it has often been the only method for checking the validity of per- 
sonality questionnaires has led to considerable skepticism regarding 
the behavior characteristics which these tests were actually measuring. 

THE QUESTION OF “CAPACITIES” 

In closing this brief survey of some of the major problems of psycho- 
iogical testing, a word should be added regarding the relationship of 
tests to the commonly misused concept of “capacity.” The original 
aim of the mental testers was the measurement of the individual’s 
“capacities,” or “potentialities,” of behavior development, as dis- 
tinguished from his present skills and information. The measurement 
of the latter would have been a relatively simple task. If we want to 
ascertain whether an individual is proficient in many languages, for 
example, we need only to examine his knowledge of all languages 
with which he claims familiarity. But if we want to know whether this 
individual can learn languages easily, whether it would be worth the 
efiort to teach him, or whether he should consider a vocation which 
demands a mastery of several languages, then we arc laced with a 
much more dilhcult problem. This is the type of problem with which 
mental testers ha\c tried to cope. 

If one is to determine what the individual can do rather than what 
he has already accomplished, it has been argued, it is necessary to 
“rule out” in some way the differences in formal or specialized train- 
ing among diflerent individuals. This is usually attempted either by 
presenting material which is equally unlamiliar to all or by the 
reverse procedure of utilizing only material common to everyone’s 
experience. Frequently the two methods may be combined in different 
items of a te^t, or even in the same item, as in the use of familiar 
material in a novel and unusual manner. 

Such a procedure is a practicable one and will yield usable informa- 
tion, provided that due cognizance is taken of its assumptions and 
limitations. In the use of cither “familiar” or “unfamiliar” material, 
it is necessary to ascertain whether the material is actually familiar 
(or unfamiliar), to an approximately equal degree, to all the subjects 
being tested. When given to persons from dilferent national or cultural 
groups, or from widely differing economic, social, or educational back- 
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grounds, psychological tests do little more than reflect the varied back- 
grounds of the subjects. 

No psychological test has any mysterious power in itself whereby 
it can strip the subject's behavior of the accumulated effects of his 
rcactional biography, and reveal his original, carefully insulated 
“potentialities.” In mental testing, the terms “potentiality” and “ca- 
pacity” can be used meaningfully only in the sense of prediction of 
subsequent behavior from present behavior. The prediction may like- 
wise cover a wider range of behavior than that included in the test, if 
such a prediction is proved to be valid. But the starting point of such 
predictions is always present behavior, not anything projected back 
inti) some hypothetical pristine state. 
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Pr;PlJLAR OPINFON FREQUPNTT.Y CLASSIFIES PEOPLE in Tcfcrcnce tO the 
possession or non-possession of certain traits. Thus one individual is 
said to have a talent for music, another for painting, a third for 
mathematics, a fourth for organizing people. wSuch a characterization, 
however, results from purely practical considei'alions. In order to 
choose music as a vocation, or even as a serious avocation, for ex- 
ample, an individual must have a certain minimum of musical talent; 
if his degree of musical ability falls below that minimum, he is not 
regarded as “a musical person.” Moreover, in our society we are 
accustomed to characterizing the individual in terms of his outstanding 
assets and liabilities, and simply ignoring the traits in ^^hich he rates 
close to the norm. Hence we label Mr, Jones a violinist, Miss Smith a 
skater, and Mr. Doe a thief. We do not ordinarily characterize Mr. 
Doe as a mediocre skater. Miss Smith as a relatively poor violinist, 
or Mr. Jones as an “averagv honest man!” ^ Qualitative distinctions of 
this sort are made in practice and arc based on arbitrary or socially 
determined criteria or limits. 

Actually, however, every individur! can be described along a con- 
tinuous scale in any bciiavior category. In other words, individuals 
do not fail into sharply divided types, individual dilTcrcnccs are rather 
a matter of decree. It is in this sense that individual differences are said 
to be quantitative rather than qualitative. 

To be sure, it might be argued tha^ ^hcre are certain characteristics 
Vv'hich a person may either have or noc have, and that in this respect 
we may speak of qualitative dilTercnccs. The classical examples are 
such sensory handicaps as loss of vision or hearing. Here, it would 

^ The very fact that wr hnvc :i word foi “Ihicl” but no wo^d tor ‘‘avciage honest 
man'’ is a further jlluslrauon of the same point. 
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seem, are traits characterized by presence or absence: a person can 
see or he cannot see, he can hear or he cannot hear. This, too, turns 
out to be a purely conventional and practical distinction. Anyone who 
has visited a school for the blind knows that there are many degrqps 
of blindness, and that not all those classified as blind are totally 
blind. The everyday working definition of blindness is any degree of 
visual deficiency too serious to permit normal activity. The same is 
obviously true of deafness and any other sensory disorder. Between 
the empirically established ‘"normal” vision or hearing and what is 
classed as blindness or deafness there is to be found a continuous 
gradation of minor deficiencies. It should be added that the existence 
of a trait in zero degree, as in total blindness, is not inconsistent with 
the quantitative view of individual differences. The latter imnlies only 
that there be intermediate degrees rather than simple presence or 
absence. 

THE DISTRIBUTION OF INDIVIDUAL DIFFERENC ES 

Since individual differences arc quantitative in the above ^sense, 
we may now ask how the varying degrees of each trait arc distributed 
among people. Are individuals scattered uniformly over the entire 
range or do they cluster at one or more points? What are the relative 

TABLE \ Frequency Distribution of Scores of 1000 College Students on 
a Simple Learning Test 

( I'rt)!!) Aii.ist.i''! 3, p i 1 


( lass-ItUd at 

/ trqucnc' 

- - - 

- - - - 

52-55 

) 

48-5 1 

1 

44-47 

20 

40-43 

73 

36-39 

156 

32-35 

328 

28-31 

244 

24-27 

136 

20-23 

28 

16-19 

8 

12-15 

3 

8-11 

2 

N -- lOOO 



feature and Extent of Individual Differences 61 


frequencies with which different degrees of a trait occur? These ques- 
tions can best be answered by an examination of frequency distribu- 
tions and frequency graphs. 

Like all statistical devices, the frequency distribution is a means of 
sitmmarizing and organizing quantitative facts in order to facilitate 
their treatment and reveal significant trends. Scores on a test, or any 
other set of measures, are grouped into class-intervals, and the num- 
ber of cases falling within each interval is tabulated. An example of 
a frequency distribution is given in I’able 1. This shows the scores of 
1000 college students on a simple learning test. The scores range 
from 8 to 52 and have been grouped into class-intervals of four 
pr>ints. The advantages of such a table over a list of 1000 individual 
scores arc obvious. 



• Scores 

Fig. 4. Distribution Curves: Frequency Polygoji and Histogram. (Data 
from Table 1.) 


The facts brought out by a frequency distribution can be made 
more vivid if presented pictorially by means of a frequency graph. 
Jn Figure 4 are shown the data of Table 1 in graphic form. The base 
line or horizontal axis represents the scores; the vertical axis shows 
the frequency or number of cases falling within each class-interval. 
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The graph has been plotted in two ways, both being about equally 
common. One graph is a frequency polygon, in which the number of 
individuals within each interval is indicated by a point, centrally 
located in respect to the class-interval; the successive points are then 
joined by straight lines. The other graph is obtained by erecting*" a 
column or rectangle over each class-interval, the height of the column 
depending upon the number of cases in that interval. This type of 
graph is known as a histogram, 

THE NORMAL CURVE 

The reader will already have noticed certain characteristics of the 
distribution presented in Table 1 and Figure 4. The majority of cases 
cluster in the center of the range and as the extremes are approached 
there is a gradual and continuous tapering off. The curve shows no 
gaps or breaks: no clearly separated classes can be discerned. The 
curve is also bilaterally symmetrical, that is, if it should be divided by 
a vertical line through tlic center, the two halves so obtained would 
be nearly identical. This distribution curve resembles the bell-shaped 
‘'normal curve,” the type most commonly found in the measutt^nient 
of individual differences. The theoretically determined, ideal normal 
curve is illustrated by the graph reproduced in Figure 5. 

The concept of the normal 
curve is an old One in statistics. 
It first became familiar as the 
noniKil probability curve. The 
probabililv of the occurrence of 
an event is the expected relative 
Scores frequency of occurrence of the 

Fig. 5. Thcorclical Normal Curve, given event in a very large, or 

^ infinite, number of observa- 
tions. This probability is represented by a ratio or fraction, the 
numerator of which is the expected outcome, and the denominator 
the total possible outcomes. Lhus the probability or chances that when 
two coins arc tossed only heads will come up is W, or one out of four 
possible occurrences,- the probability of one head and one tail is Vz ; 
and that of two tails, If the number of coins is increased, say 

^ This follows from the fact that the only possible combinations of heads (H) 
and tails (T) which can occur when two coins fall arc the following four: HH, HT, 
TH, XT. Just one ot these lour (HID contains only heads. 
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to 100, so that the number of possible occurrences or combinations 
becomes very large, we can still determine mathematically the chances 
of any one combination, such as all heads or twenty heads and eighty 
tails, occurring. These probabilities, or expected frequencies of 
occurrence, can be plotted graphically by the same method outlined 
above for plotting scores. The curve obtained when the number of 
cojns is very large will be the bell-shaped normal probability curve. 
In Figure 6 are shown the theoretical and obtained frequencies for 12 



Fig. 6. The(’>ictical aiul Ohscjved Disli ibulions i J Results in 4096 Throws 
ol 12 Dice. (Data from Yule and Kendall, 35, p. 424.) 

dice thrown 4096 times. In each throw, the number of dice showing 
a 4. 5. or 6 spot uppermost was determined. This number Lould, of 
course, vary from zero to ’2, I’-e total number of dice thrown. The 
graph shows the relative frequency of each combination in the total 
4096 throws. It will be noted that there is a very close agreement 
between the theoretical and obtaineo curves. 

The results obtained by tossing coins or throwing dice are said to 
depend upon ‘'’chance.” By this is meant that the outcome is deter- 
mined by a large number of similar, equal, and independent factors. 
ITe height from which a coin or die is thrown, its weight and size, 
the twist of the hand employed, and many similar conditions deter- 
mine which particular face will fall cppermost. Likewise, a person’s 
height, or weight, or performance on an intelligence test can be re- 
garded as depending upon a variety of independent factors, each 
having about equal influence upon the result. Thus it has been sug- 
gested that the operation of chance is responsible tor the distribution 
of human traits according to the normal frequency curve. It does not 
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follow, however, that if a characteristic is normally distributed, it is 
necessarily the result ol “chance factors” as defined above. 

flic normal curve also appears in a different situation as the curve 
of enor When repeated measurements are made, the results will not 
be identical on successive occasions Such fluctuations, or “errois,” 
are present to a greater or lesser degiee m all types of measurement 
The length of a table as measured by a meterslick, the speed of a 
simple movement, or the ^.esthetic appeal ol a work of art will not 
remain the same on repeated observations If a verv large number of 
observations of the same object or phenomenon are made, and the 
lesults found on successive occasions arc plotted in a frequency 
graph, a normal curve will be obtained The errors of observation or 
measurement which produce the variation are themselves the result 
of chance factors, and hence the curve of error, like the distribution 
curve, will approximate the normal probability curve 

OTHER TVPFS Ol DISTRIBUTION CLRNES 
AND WHAI THEY MIAN 

The implications of the normal distribution curve for a ps\choloi:> 
ol individual dilTcrenccs can be realized more \ividl> by contrasting 
this form of distribution with other possible types The distributions 
chosen in particular for this comparison are those implied by certain 
common theories and beliefs in regard to individual differences 
They are also occasional!) found with actual test results because 

of the use ol faulty techniques 
or the operation ol special fac- 
tors 

A skewed disti ibution is one in 
which the peak or “mode ’ of the 
curve IS displaced to either side of 
the center Such a distribution 
Fig. 7. A Skewed Distnbutu i lacks the bilateral symmetry of the 

normal curve In Figure 7 will 
be found an illustration of a skewed curve, with a piling up of scores 
at the upper end ol the distribution Such a distribution is implicit 
in the popular conception of many char icter traits Thus the ma|ority 
of people are considered “honest” and are piled up at one extreme 
of the scale; from this point, the number of cases is believed to de- 
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crease steadily as the opposite extreme is approached. As will be 
illustrated in a later section of the present chapter, this type of 
distribution is not ordinarily found when adequate measures of 
character traits are used, i.e., measures which are capable of differen- 
tiating degrees of response. 

In a number of behavior characteristics indicative of social con- 
formity, a type of distri|pution known as the J-curve is often found. 
This curve, named after its resemblance to the letter J, is in reality 
a highly skewed curve, with the majority of people falling at that end 
which represents complete or nearly complete conformity. A favorite 
illustration of such J-curves is found in the reactions of motorists or 
pedestrians to various tralhc regulations, such as stopping for traffic 
lights, stopping at intersections, or driving within the proper traffic 
lane An example of such a curve is reproduced in Figure 15. Other 
illustrations of “conforming behavior" to which the J-curve has been 
applied include observations of religious practices, such as time of 
arrival at services, participation in group singing, amount of kneeling, 
and the like. 


A type of distribution not so frequently found as the skewed curve 
but nevertheless assumed in certain common practices is the rectangu- 
lar distribution, illustrated in Figure 8. If individual differences were 

distributed in this manner, it would 



Fig. 8, A Rectangular 
Distribution. 


mean that there were as many gen- 
iuses and idiots as mediocre peo- 
ph , as many men whose height is 
6 feet 6 inches asffiiose whose height 
is 5 feet 8 inches. It is interest- 
ing to >peculate on the effect which 


such a situation would have on our 


sense of values. Our thinking is so permeated with the knov/iedge 
that extreme degrees c'f a trait arc relatively infrequent, that it is 
difficult even to conceive of a world in which extremity did not imply 
rarity. 

The assumption of a rectangular disoibution of traits is implicit 
in certain common misuses of percentile scores. In the percentile 
system of scoring, it will be recalled, the subject’s standing on any 
test is expressed in terms of the percentage of people in a given 
group whose scores he excels. When comparing individuals who 
receive, let us say, percentile scores of 90, 80, 60, and 50, we must 
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bear in mind that the difference in ability between the first two cases 
is greater than that between the last two, although in both pairs the 
difference is 10 percentile points. In order to include the 10% of the 
cases which fall between the 90th and 8()th percentiles, we must cover 
a much longer distance on the base line of the normal curve than is 
necessary in going from the 5()th to the bOlh percentiles. This results 
from the greater clustering of individuals near the center of the curve, 
and the relatively small number ol cases at the extremes. Only if the 
trait disirihution were rectangular would suett si\e percentile scores 
represent equal units of ability. This does not mean that percentile 
scores are of no value. Like mental ages, they furnish a simple and 
vivid means of expressing the subject’s shmding on a test. Such devices 
do not, however, furnish an equal unit scale of ability. Neither per- 
centiles nor mental ages, lor example, lend themselves to averaging 
or to similar arithmetic operations, because ol such inequality of 
units. 

Lastly. speci<il mention should be made of the nudtiniodal distribu- 
tion because of the prominent part it plays in so-called type theories. 
A multimodal curve is one having more than one mode or peak. 
Instead of a single clustering of individuals in the center as in the 
normal curve, or at either extreme as in a skewed curve, the cluster- 
ing occiiis at several points. The peaks ma> be equally large, or there 
may be a major peak and one or more minor ones. The mo^t popular 
variety seems to be the bimodal curve, with two approximately equal 
peaks. All the common schemes of classification which place indi- 
vidu.ils into distinct categories presuppose some form of multimodal 
distribution. The division of men into the genius, the normal, and 
the feebleminded, the sane and the insane, the sociable and the un- 
sociable, all rest upon a tacit assumption that “most people’" can be 
classified clearly into one of these groups, with possibly a few inter- 
mcdialc doubtful cases, it is iiiteicsting to note that these distinctions 
are much less common in the realm of physical traits, where con- 
tinuity of variation is more apparent to live naked eye. 

CONDITIONS WHICH AFFECT THE SHAPE 

OF THE DISTRIBUTION CURVE 

Distribution curves which deviate significantly from normality and 
which exhibit one or more of the properties discussed in the preceding 
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section occur from time to time because of the operation of special 
factors. A consideration of some of the most common of these factors 
is essential for the proper interpretation of frequency curves. The 
conditions which may influence the shape of the distribution curve 
inefude peculiarities of samplin^^ inadequacies of the tests or other 
measures employed, and factors which t)pcratc directly upon the 
distribution ol the behavior itself. Among the kr^i-named type of 
laclors are pathological conditions and socially imposed constraints. 
In the following sections we shall consider each of these various con- 
ditions in turn. 

Sampling. It would, of course, be possible to obtain any conceiv- 
able t\pe of distribution by deliberately choosing subjects to fit the 
pattern, fhere would obviously be no object to such a procedure. 


I — t -- I - - I ■ T h- 

A. Two Groups Plotted bepaiatcly 



13. l^^o woup" Coi.ibincd 


Fig. 9. Skewness Resulting lioni the Comhmatioii of (jroups with Dif- 
feient Means and Vai labihues. 


Similar variations may, however, occur through the operation of selec- 
tive factors which may have been o\erlookcd by the investigator. 
Whenever a curve de\iates significantly from normality, the adequacy 
of the sampling ought therefore to be examined 

Skewmess may result, for example, fn n; ^hc inclusion within a single 
distribution of two normally distributed groups which differ pro- 
nouncedly in both average and variability. This effect is illustrated in 
Figure 9. In Graph A arc given the separate distribution curves of 
the two groups, one of w hich has a low^ei average as w'cll as a narrower 
scatter of scores than the other. Graph B shows the definitely skewed 
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curve which is obtained when both groups are combined and plotted 
as one distribution. 

A multimodal curve can also be obtained if the sampling tested is 
not chosen at random from the general population, but consists of 
individuals selected from widely differing levels and combined into a 
single group. A group consisting of 5-year-olds and 10-year-olds, for 
example, would present a definitely bimodal distribution in intelligence 
test scores, as well as in height, weight, and many other characteristics. 
Were the intervening age groups from 6 to 9 to be included in this 
sampling, the distribution would take on the appearance of the normal 
bell-shaped curve. 

The production of a bimodal distribution by combining two curves 
of widely separated groups is illustrated in Figure 10. It will be 




Fig. 10. Bimodality Resulting from the Combination of Two Groups 
with Widely Varying Means, 


noted that the overlapping between the two groups is very slight. When 
the overlapping is large, as in the case of adjacent age groups, the 
resulting combined curve will be normal and unimodal. An example of 
a bimodal curve pk')ttcd with actual scores is presented in Figure 11. 
The two distributions which are combined in this curve consist 
of the Army Alpha scores obtained by two groups in the United 
States Army during World War 1. The lower group includes 2773 
native-born white soldiers who had reached no higher than the fourth 
elementary grade when they left school; the upper group consists of 
3954 officers who had had four years of college work. The combined 
curve exhibits the definite bimodality which would be expected. 

Other peculiarities which may result from sampling include exces- 
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sive flatness of the distribution curve (approximating a rectangular 
distribution), or its reverse, excessive peakedness. The latter might 
occur, for example, if the sampling is exceptionally homogeneous. 
Fipplly, it should be noted that an unlimited number of minor irregu- 



Alpha Scores 


Fig. 11. A Bimodal Distribution Obtained by Combining Extreme 
Groups: Alpha Scores of 2773 Soldiers with 4th Grade Education and 
3954 OlTicers with 4 Years of ^"ollcce. (Data from Yerkes, 34, pp. 773, 
777.) 

larities and variations in distribution curves may occur through the 
use of small groups. Cur>cs plotted from a small number of cases 
usually present an uneven, jagged appearance, since individual excep- 
tions loom relatively large. In general, the larger the sampling, the 
“smoother” will be the distribution curve. 

Inadequacy of the Testing Range. If the range of difficulty covered 
by the test items is restricted at the upp.i or lower levels, a skewed 
curve may be artificially produced. Such a distribution will be ob- 
tained when any test is given to a group for which it is not suited. 
Thus if the National Intelligence Test, which is adapted to grades 3 
to 8, were administered to a college class, the large majority of 
subjects would score very near the maximum, and the number of 
cases would decrease rapidly toward the lower scores. Similarly, if 
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one of the many tests constructed for use on college freshmen were 
given to elementary school children, there would be a marked piling of 
scores near the zero end of the scale, and the distribution would be 
equally asymmetrical. 

Obviously these data could not be taken to mean that intelligence 
is not normally distributed among school children or college students. 
Such skewed distributions result from the fact that the difficulty range 
of the test does not extend far enough in the upper or lower direction. 
In the one case, all of those subjects who have more than a certain 



Fig. 12. I he EfTecl of Restricted Testing Range upon the Form of the 
Di^trlbulion C'urve. 


minimum of the ability tested will make a perfect or nearly perfect 
score, whereas if the test luid included more difficult items, these 
subjects would have scattered over a wide range. 1 his is illustrated 
in Figure 12, the solid line showing the actual distribution of ability 
in the group, and the broken line the curve which would result from 
the use of a test with a low ‘ ceiling.” In a similar manner, a piling up 
of zero or very low scores will occur when the test is too difficult 
for the group. In choosing a test for a given group, therefore, care 
must be taken to insure that the subjects have sufficient leeway at 
both ends of the scale. The highest and lowest scores obtained should 
be a considerable distance from zero and perfect scores, respectively. 
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Inequality of Test Units. It can readily be demonstrated that in- 
equality of units in the measuring instrument can distort a frequency 
distribution in various ways, A good illustration is furnished by data 
recently collected on visual acuity by means of two tests (27). The 
fretjuency distributions of the same group of 226 persons on each of 
these tests are shown in Parts A and B of Figure 13, respectively. 
Graph A is a sharply peaked and skewed curve obtained with the 
familiar Snellen chart, in which the subject’s visual acuity index is 




A, Snellen Chart Index B. Equal -Unit Scores of Visual Acuity 

Fig. 13. Distribution of 226 Persons on I wo lests of Visual Acuity. 
(From Tiffin and Wirt, 27, p. 8.) 

based upon the smallest row of letters he can read at a standard dis- 
tance of 20 feet. Thus an individual who at 20 feet can see no 
more than the letters which he average person reads at 50 feet is 
said to have 20/50 vision. Normal vision obviously corresponds to 
a 20/20 index. An index such as 20/1 5 indicates better-than-average 
vision. Because of the particular choic of letter sizes in this test, not 
all acuity levels are sampled to an equal degree, the poorer acuity 
levels being represented by more items than the average or superior 
acuity levels. In other words, the differences in difficulty level between 
successive rows of letters arc not equal; there are larger “gaps” in 
difficulty level in the center and upp-r portions of the acuity scale 
than in the lower portion. 

This inequality of units can be illustrated by a comparison of the 
items or units on the Snellen chart with those on an equal-unit scale 
of visual acuity, as shown below (27, p. 9) : 
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Acuity Scale 
Snellen Chart 


1 2 

3 

4 

5 

6 7 

8 9 

10 11 

12 13 

14 15 

20 20 

20 

20 

20 

20 

20 

20 

20 

20 

200 Too 

70 

50 

40 

30 

25 

20 

15 

13 


The distribution of the scores of the same group of 226 persons on the 
equal-unit acuity test is given in Part B of Figure 13. It will be noted 
thtit this graph approximates a normal curve much more closely than 
does the distribution of unequal-unit scores. 


TABLE 2 Artificial Binwdahty Reiultin^ from Inequality of Units 


Equal 1 'nit Sc me s 

Ti St Score s 

Niinihci of C uses 

15 

1 

1 

16 

o 

3 

17 

3 

7 

18 

4 

32] (29) 

19 


in 

20 

5 

20 

21 

6 

17] (29) 

22 


12( 

23 

7 

7 

24 

8 

3 

25 

9 

1 



100 


Other types of variation from normality of distribution may also 
result from inequality of test units. The artificial production of bi- 
modality may be demonstrated by the highly simplified hypothetical 
example given in Table 2. Let us assume that the entries in column 1 
represent equal units of a given ability and those in column 2 the 
corresponding scores on a nine-item test designed to measure this 
ability. We may further assume for simplicity that the nine test items 
are so steeply graded in difficulty that no subject can succeed with any 
one item if he has failed any previous item. In such a case, the total 
scores in column 2 will correspond exactly to the most difficult item 
which the subject is able to complete. It will be noted that certain 
ability levels (column 1 ) are not represented by test items (column 2) . 
Thus there are no items to correspond to ability levels 19 or 22. The 
third column gives the distribution of 100 subjects in the ability under 
consideration. Obviously the 17 persons falling at ability level 19 do 
not have enough ability to succeed with item 5, which requires ability 
level 20; they will therefore have to stop with item 4 and thus augment 
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the group of 12 persons who have just barely enough ability to com- 
plete item 4. The same will occur in the case of the 12 persons who 
fall on ability level 22. 

'fhe distribution curves of the equal-unit scores and the test scores, 
respectively, are given in Figures 14A and 14B. It will be noted that. 



Fig. 14. Hypothetical Distiibiitions of 100 Persons on an Equal-Unit 
and an Uncqual-Unil Measuie ol the Same Ability. 


although the former is practically a normal curve, a distinct bimodal- 
ity is introduced in the latter by the inequality of units. It is well to 
bea** this cilect in mind when consideiing Tcsults obtained with tests 
of certain personality characteristics, such as introversion-cAtro version. 
Being defined in bipolar terms, such traits may have been sampled 
more thoroughly in their extreme mauilestations, while their inter- 
mediate degrees may have insutlicient coverage. Such a test would thus 
have poorer disci iminative value and larger gaps between units in the 
center of the range than at the extremes, as in our hypothetical illus- 
tration of fable 2. As a result, a slight bimoJality could easily occur 
simply from the peculiarities of the measuring instrument, regardless 
of the distribution of the behavior itscif 

Pathological Conditions. Deviations from normality of distribu- 
tion may result from conditions which affect the development of the 
behavior itseff, rather than from characteristics of the test or of the 
sampling. An example of such an effect is to be found in the distribu- 
tion of lO’s. When the total population is considered, the distribution 
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shows ail excess of extremely low lO’s over what would be expected in 
a normal curve. In one extensive survey conducted in England, for 
example, the proportion of cases with lO’s below 45 was about 18 
times as great as would be expected in a normal distribution with the 
obtained mean and SD " (20). 

The most plausible explanation for such a deviation from normality 
would seem to be that secondary factors, such as disease or pathologi- 
cal conditions, increase the relative proportion of feebleminded per- 
sons (cf., e.g., 14). It will be recalled that a normal distribution will 
be obtained if the variable being measured is the composite result of 
a very large number of independent and equally weighted factors. 
Considering the extremely large number of both hereditary and en- 
vironmental factors which contribute to the development of intelli- 
gence in the general population, it is reasonable to expect lO’s to 
distribute themselves in accc^rdance with the normal curve. If, how- 
ever, any factors should operate with disproportionate weight, then 
the effect on the curve would be equivalent to the use of loaded dice 
in disturbing “chance'’ results. Pathological conditions, which may 
lower the JQ but can never raise it, may be regarded as sucl^ “load- 
ing*’ inlluences. It should be noted that the data concerning tiie lower 
end of the distribution of IQ s, as well as the interpretation of such 
data, are still highly tentative. 'Fbey arc here cited merely as an illus- 
tration of the possible clTcct of pathological conditions upon distribu- 
tion curves. 

Socially Imposed Constraints. Another factor wTiich may “load 
the dice” and alter the distribution of behavior characteristics is to be 
found in socially imposed barriers. Such conditions often produce the 
highly skewed J-curve described in an earlier section. Hie CiTect C)f 
social constraints upon the form of the distribution can be illustrated 
by the behavior of motorists. At an ordinary intersection with no 
IralTic signal, the behavior of drivers will probably lollow^ the normal 
curve, the majority exhibiting a moderate amount of caution, very few 
coming to a full stop, and equally few continuing at the same rate of 
speed with no observation of oncoming tratlic. If, now, red signal lights 
and a policeman arc installed at the intersection, these external con- 
straints will pull the distribution into a J-curve. Figure 15 shows the 
distribution of the responses of 102 motorists at an intersection with 

^Cf. fuller repoit of this invcstigiition on pp. 81 - 82 . 
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no cross traffic approaching, but with red signal lights and a traffic 
officer.^ It will be noted that over 90% came to a full stop. Of the 
remaining small per cent, a few^ slowed down markedly, still fewer 
slov^ed down slightly, and a very small number continued at the same 
speed ( 1 ) . 

It should he noted that the location of the peak depends upon the 
point in the scale at which the so- 
cially imposed behavior falls. The 
extreme or true J -curve is not 
necessarily obtained in all situa- 
tions involving social conformity. 

Thus the degree to which urban 
adults in America partake of al- 
coholic beverages would prob- 
ably show a peak, not at either 
extreme, but at an intermediate 
point corresponding to ‘'moder- 
ate social drinking.'’ This point 
probably represents maximum 
conformity to the praciiccs of 
the group, but it does not rep- 
resent either a maximum or a 
minimum in terms of drinking 
behavior. It is not the J -curve 
itself that is important, but 
rather the fact that variations 
in the distribution cuivc may be 
introduced by social ccjiiform- 
ity. The J-curve is only a spe- 
cial instance of the clTects of 
this type of “loading" factor.'"’ 

Thi^ curve jcscmMcs the letter L more Ilian it dojs Uie letter J, but it has be- 
come conventional to lefcr to all such highly skewed cuivls as J-cuivcs, regardless of 
whether the peak is at ihe extreme ughi or ext-eme lelt. 'the direction of the scale 
could, of course, be aiVauai ily leveised in all sue . ^ases, so that the peak would, be 
at the right. 

^ It has been suggested by some write? s fe.g., 2, pp. 3 32-? ^7) that the normal 
curve may be regarded as tw'o .T-cuives ‘*back-lo-back, so to speak (JLI* The normal 
curve may, ot course, be hiokcn up in an infinite number of ways and conceived 
as a composite ot any number of aibitranly .separated parts. This docs not, however, 
in any way alter l!ie ciidracicrjM c.> of the total distribution, no: the mathematical 
properties of the cuive. 



Slow Slow Speed 

Response at an Intersection with no 
Cross-Traffic but with Red Signal 
Light and Truffic Officer 

Fig. 15. J-( urve of Motorists’ Be- 

havior. (From F, H. Allport, 1, p. 
144.) 
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Fig. 16. Distiibution of Height for Acluh English-Boin Men. (From 
Yule and Kendall, 35, p 95 ) 



Vital Capacity in Cubic Centimeters 

Fig. 17. Vital Capacity ol 1633 Male College Students. (From Hams 
et al., 8, p 94 ) 
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SOME TYPICAL DISTRIBUTIONS 

In Figures 16 to 31 will be found examples of distribution curves 
obtained for a wide variety of human characteristics. These distribu- 
tions were chosen principally because they were based on large, repre- 
sentative samples, most of them including 1000 or more cases. A few 
curves plotted from smaller groups have been included to illustrate 
the distribution of physiological and of certain personality character- 
istics, since in these areas data on large groups are relatively scarce. 

An example of the distribution of a purely structural trait is fur- 
nished in Figure 1 6, which shows 
the ht i^ht in inches of 6194 Eng- 
lish-born men. It will be seen that 
the graph approximates the math- 
ematical normal curve to a re- 
markably close degree. Figure 17 
presents the frequency curve of a 
nu'ic functional, physiological 
trait, vital capacity. This is the 
total volume of air, measured in 
cubic centimeters, that can be ex- 
pelled from the lungs after a 
maximal inspiration. The meas- 
urements from which the curve is 
plotted were made on 1633 
male college students. The gcii- 
eral correspondence to the normal curve is again apparent. 

Figures 1 8 and 19 are concerned with physiological measures which 
are believed to have some relationship lo emotional and personality 
characteristics. The first shows the distribution of 87 children in a 
composite measure of ^'autonomic balance.” High scores in this meas- 
ure indicate a functional predominance of the parasympathetic division 
of the autonomic nervous system; low scores, a functional predomi- 
nance of the sympathetic division. To i-'ychologists, the autonomic 
nervous system has been of special interest because of its role in emo- 
tional behavior. The distributions of 74 children in two different 
indices of muscular tension are shown in Figure 19. 

The two graphs reproduced in Figures 20 and 2 1 illustrate the dis- 
tribution of performance on sensori-molor and simple learning tests. 



Fig. 18. Distribution of Mean Es- 
timates of Autonomic Balance for 
87 C hildrcn between the Ages of 6 
and 12. (From Wenger and Elling- 
ton, 33, p. 252.) 
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Reference may also be made in this connection to the data reported 
previously in Table 1 and Figure 4. All three sets of measures were 
obtained on the same group of 1000 college students. The tests whose 
distributions have been reproduced include cancellation, Pyle symbol- 
digit, and a nonsense-syllable “vocabulary*’ test. In the first, the score 
is the total number of A's in a page of pied type cancelled in one min- 
ute. This is generally regarded as a simple lest of attention and per- 
ception, although speed and control of movement are also involved. 
The symbol-digit test is a simple learning test o"’ the code substitution 
variety. The vocabulary test is a more difficult learning lest, also 



Fig. 19. Dislribulion of Two Measures of Muscular Tension for 74 C’hil- 
dren between the Ages ol 6 and 12. (From Wenger, 32, p. 222.) 


employing a code, which in this case consists of paired nonsense syl- 
lables. The distributions of all three tests fall within the expected values 
of the theoretical normal curve.'* 

Typical results obtained with intclUgcncr tests administered to large 
samplings are presented in Figures 22 to 26. Figure 22 gives tlie dis- 

Mathematical tests of normality were applied to these carves (cf. Anastasi, 
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tribution of the lO’s of 2904 children between the ages of 2 and 1 8 on 
the 1937 revision of the Stanford-Binct. Reference to the graph will 
show that the largest per cent of cases received lO's in the middlemost 
class-interval, from 95 to 104. The per cent tapers off gradually until 
onl}^ a small fraction of 1% is found with IQ's between 35 and 44, 
and between 165 and 174. Institutionalized feebleminded subjects 
were not included in this distribution, the sampling also being restricted 
in certain other ways. Thus the group consisted entirely of American- 
born white subjects, with a somewhat greater proportion of urban 
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Fig. 20. Number of A’s Cancelled in One Minute by 1000 College 
Students. (From Anastasi, 3. p. .t2.) 

residents than is found in the total population of the country. The 
major portion of the sampling was composed of elementary school 
children, an effort having been made to ser^ue groups at the younger 
and older ages which were roughly comparable to the elementary 
school population. It might be noted that the range of IQ’s for the 
total population, as determined from the data of various investigators, 
actually extends from nearly zero to slightly over 200. 

Distributions of scores on group tests, obtained with children as 




80 Differential Psychology 



19 29 39 49 59 69 79 89 99 109 119 129 139 149 

Scores 

Fig. 21. Scoies of 1000 College Students on a Symbol-Digit Code-I earn- 
ing lest iProm Anastasi, 3, p 34 ) 



44 54 64 74 84 94 104 114 124 134 144 154 164 174 

Stanford- Binet IQ 

Fig. 22. Stanford-Binet IQ’s of 2904 Unselcctcd Children between the 
Ages of 2 and 18. (From Terman and Merrill, 24, p. 37 ) 
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well as adults, are illustrated in'4he next two graphs. Figure 23 shows 
the percentage distribution of 5952 sixth grade school children on the 
Advanced Otis Examination, a widely used group test of general 
intelligence. 

The distribution of the AGCT scores obtained by 9,339,289 men 
during World War 11 is reproduced in Figure 24. This distribution 
exhibits two noteworthy deviations from the general form of the nor- 
mal curve (23). The most conspicuous deviation is the sudden piling 
up of individuals as the standard score of 40 is approached, a score 
which is close to the actual zero point of the test. The AGCT is 
unsuitable for measuring the abilities of persons having less than the 
equnalent of a fourth grade education. Consequently, when this test 

is administered to an unsclccted 
sampling, a piling up at or near a 
raw score of zero will occur. 
Many of the individuals who fell 
into this category were illiterate, 
and for most of them the zero 
score merely indicated that they 
should be reexamined with a non- 
verbal test. The broken line in 
Figure 24 show's the extrapolated 
end of the distribution which 
would probably have been ob- 
tained if the test had had a much 
lower zero point. A further char- 
acteristic to be noted in the curve is the small bulge between the scores 
of 60 and 80. One explanation which In.s been olTcrcd for this bulge 
is that a considerable proportion of the population have little interest 
in continuing such academic activities as reading and arithmetic after 
leaving school and they therefore allow these skills to retrogress (23 ). 
These individuals w'ould thus make a poorer showing on such a test 
than they would have made earlier. 

Of special interest arc a few' intelligence test surveys conducted on 
complete or nearly complete populations of children. In one of these 
(cf. 19, 20), the population chosen consisted of all children born 
between September 1, 1921 and August 31, 1925 and living within 
the boundaries of the city of Bath, England, on July 27, 1934. The 
Advanced Otis Examination (Form A) w^as given to all except the 



23. Pereeniage Distribution of 
5952 Sixth Ciradc School Children 
on the Otis A^dvanced Examination. 
(From ihorndikc ct al., 26, p. 5'^^' ) 
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defective children, who were subsequently tested with the 1916 revi- 
sion of the Stanford-Binet. All children falling below a certain score 
on the Otis were also retested with the Stanford-Binet (cf. 20). An 
unusually close approximation to the desired population was achieved, 
the number actually tested being 3361 out of a total of 3398 cases 
which fell within the specifications given above. The distribution of 
intelligence for the group as a whole did not deviate significantly from 
the normal curve. The tit was good for all portions of the obtained 



AGCT Scores 


Fig. 24. Scores of 9V\ Million Men on the Army General Classification 
'lest. (From Personnel Research Section, A.G.O., 23, p. 415.) 

distribution with the exception of the lowest 1 % of the group, whose 
lO’s were 63.4 or below. The number of children with lower IQ’s 
w'as in excess of the expected proportion, although the deviation was 
not significant until about the level of IQ 45. Below this level the 
excess was marked.'^ 

Probably the most ambitious sampling project undertaken to date 
was the testing of all children who had been born in Scotland in 1921, 
with the exception of the blind and the deaf (22). A specially de- 

Cf. the citation and discussion of these same findinus on p. 74. 
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signed 45-minute group test consisting of two pages of pictorial and 
five pages of verbal items was employed, together with a preliminary 
10-minute practice test. All testing was done on June 1, 1932, the 
children therefore ranging in age from lOVi to IIV^ at the time of 
testing. A total of 87,498 children were tested, a sampling which the 
authors describe as complete except for a negligible number of chil- 
dren in certain private schools and a few who were absent through 
sickness or other causes. It will be noted, moreover, that the testing 
occurred at a time of the year when absences are at a minimum. Sepa- 
rate scores are reported on the verbal and pictorial parts df the test. 
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Test Scores 

Fig. 25. Distribution of the Scores 
of 87,498 Scottish Children on a 
Verbal Ciroiip Test of Intelligence. 
(Data from Scottish Council lor Re- 
search in Education, 22, p. til.) 

Jhc last class-interval dcics not 
cover 10 pinnts, since the maximum 
score on the test was 76. 


The percentage distribution of 
the verbal scores is given in Fig- 
ure 25. Although on the whole 
this distribution shows a single 
clustering of scores at the center 
and a progressive decrease in fre- 
quency as the extremes are ap- 
proached, a number of irregulari- 
ties can be noted. Inequality of 
test units and inadequate cover- 
age at the low end of the scale 
are strongly suggested by a con- 
sideration of the test itself. The 
fact that 7.2% of the cases fell in 
the class-interval 0-9 further indi- 
cates that the zero point of the 
test was probably set too high for 
tlu present population. With the 
inclusion of more easy items, these 
cases would very likely have dis- 
tributed themselves over several 
class-intervals, below the present 
zero of the test. The distribution 


of the pictorial scores revealed the oppv •►ite effect, the test evidently 
being fairly well suited for the low-grade cases but too easy for the 
majority of the children. This distribution was highly skewed, with a 
marked piling up of cases at the upper end. 

Except in two aieas wlicie local circumstances necessitated testing on June 2 
and 3, respectively. 
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What is undoubtedly the most nearly complete testing of an entire 
population is to be found in the second Scottish survey (17, 25), 
reported in 1939. The population chosen for this survey was consid- 
erably smaller, including only children born in Scotland on any one 
of four specified days in 1926 (Feb. 1, May 1, Aug. 1, and Nov. J ). 
A diligent and painstaking search to the remotest corners of Scotland 
finally yielded a complete sampling, with the loss of only one case. 
The group included a total of 443 boys and 430 girls, ranging in age 
from 8 11 to 11-9 at the end of the testing. Ali children were given 
the 1916 revision of the Stanford-Binct together with eight per- 
formance tests selected from the Pmtner-Paterson Perlormancc 
Scale and other available series of a similar nature. 

The distribution of Stanford- 
Binet TQ’s for the total group of 
873 children is reproduced in Pig- 
iire 26. It will be seen that, al- 
though again exhibiting the gen- 
eral form of the nornicd curve, this 
distribution deviates from the the- 
oretieal normal curve in a number 
ol spccilic ways. 1 he distributions 
of scores on the various perlorm- 
ancc tests al'NO showed a number 
of irregularities. These results are 
not sui pricing when we consider 
that the Stanford-Binet as well as 
dien. (t loiu Macraceken, 1 7, p. 18.) most of the pcik^rmance tests used 

in this survey had been standard- 
ized on American children. It is highly probable that when such tests 
were applied to a population ol Scottish children, the leldtivc dilliculty 
ol units and the significance of raw scores w^ere appreciably alteied. 
These changes would in turn atlect the shape of the distribution curves. 

In the measurement of pa sonclity and chauu (a , testing techniques 
are still in a relatively ciude and undeveloped stage. Many sources of 
error remain, so that one should scarcely expect to find perfect speci- 
mens of the normal distribution curve. Despite a more jagged appear- 
ance and many minor irregularities, however, the available disiribu- 
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Fig. 2(i. Staiifoid-Binct 10'^ of a 
(\>nipielc Samplini^ot Scollisli ( h' 


The 1^'^7 ic\’son h.ul not >ct appealed v\hen the Icstinq ^^as bepi.n. 



Nature and Extent of Individual Differences 


85 



Extroversion Scores Introversion 


Fig. 27. Distribution ol Inlrovcrsion-Fxtroversion Scores of 200 College 
Students. (Data trom Heidbrcder, 12, p. 124.) 



Scores 

Fig. 28. Distribution of 600 College Women on the Ailport Ascendance- 
Submission Test. (From Rugglcs and Allport, 21, p. 520.) 
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tion curves exhibit quite generally the fundamental characteristics of 
the normal curve. Inspection of Figures 27 to 31 will make this 
apparent. 

Figure 27 gives the distribution of total introversion-cxtroversion 
scores on a self-rating questionnaire administered to 200 cbllege stu- 
dents. The positive scores correspond to the introvert end of the 
scale, the negative scores to the extrovert end. Tt will be readily seen 
that individuals do not cluster at opposite ends of the scale, as a clear- 
cut division into introverts and extroverts would imply. The greatest 
clustering occurs in the center, with a gradual dropping off as the 
extremes are approached.^** Figure 28, showing the distribution of 
600 college women on a test of ascendance-submission, closely ap- 
proximates the normal curve. 

Figures 29 to 31 are plotted with data taken from the studies of 
May and Hartshorne (9, 10, 11) on the measurement of character in 

school children. Figure 29 gives 
the distribution of ‘"cheating ra- 
tios” lor 2443 children. The 
cheating ratio indicates the yum- 
ber of times each child cheated 
relative to the number of oppor- 
tunities offered. The obtained 
curve does not admit of a clear- 
cut division of the group into the 
“honest” and the “dishonest” or 
into those who cheat and those 
who do not cheat. A slight skew- 
ness is exhibited, with a tendency 
for scores to pile up at the ‘"hon- 
est” end, but this may be caused 
by a limitation in the scale. 
The tests probably presented an insufficient number of situations in 
which cheating was made very easy or in which it involved a relatively 
minor “moral issue.” This would cut the scale short at the lower end 
and produce an excess of zero or very low cheating scores. 

Figure 30, giving the distribution of combined scores on several 

^^The slji;ht bimodaJity near the center of ihc scale might result fiom larger 
“gaps” between the items designed to sample intjrmcdiate degrees of this behavior, 
as discussed in an earlier section. 



Cheating Ratio 

Fig. 29. Distribution of "‘Cheating 
Ratios” of 244^ School Chiklicn. 
(Data fiom Hartshorne and May, 9, 

p. 220.) 
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Fig. 30. Distribution ot Persistence Scores among 656 School Children. 
(Llnpubl. data (rom investigation of Hartshorne, May, and Mailer, 10.) 
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Fig. 31. Distribution of Cooperativeness among 801 School Children. 
(Unpubl. data from investigation of Hartshorne, May, and Mailer, 10.) 
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tests oi persistence, likewise exhibits the general characteristics of a 
single central peak and approximate bilateral symmetry A particularly 
close resemblance to the normal curve is to be found m the distribu- 
tion of combined scores on several tests of service, or cooperativencss, 
presented in Figure 3 1 

THE NORMAL CURVE AS A METlIODOl OGTCAl PROBLEM 

As applied to psychological ehiraetcrislics, th ' normal distribution 
curve should be regarded moie as a methodological problem thin as 
a tactual observation Strictlv speaking it is impossible to cletLimine 
the actual distribution of i vaiiable unless an equal-unit scale of me is- 
urernent is employed The cflcets of inequality of units have been 
illustrated in an caiher section The only methods now available for 
obtaining equal units m a ps\ehologie il test are however, based up in 
the assumption that the behavior under consideration is itself nor- 
mally dutributed Thus to ask what is “the ’ actual distribution of 
behavior eonstiluUs at le ist foi the piesent a meaningless question 
A more signiiieant inquiry hovvever, concerns the specific eondi 
tions which determine the sh ipe of the distribution curve m any^ptii 
tie ular situation In tipproaehing this question we m i\ begin with the 
fact that the distribution most likely to result when a ehaiaeteristie 
eiepends upon a huge number of independent and equally wuphted 
induenecs is one resembling roughly the noimal curve 1 he reason- 
ableness of this cxpe^ctation for psveholomeal eh ir.ieteristies is sup- 
poited bv the known eomplevity of iheir determination It is also lele 
vant to note that nearly all phyn,.al tiaits, which are mcasuieel with 
cquil unit scales do follow the normal curve 

If then we begin wth the expectation^' that distribution curves 
will in gencMl lesemble the normal curve, anv deviation from nor- 
mality becomes a problem for mvcstig ition Such an approach to the 
form of the disTribiitu n should prove fiuitful in revealing the opera- 
tion of factors which merit studv in their own right For example a 
significant deviation from normality mav indicate that the test ceiling 
IS too low, or that its zero point is loo high, for the group being tested 
Similarly, some hitherto unsuspected selective factor operating in the 
sampling under investigation may now become apparent Finahy, the 
shape of the obtained distribution may furnish a clue to an important 

This IS not proposed as in assumption regaidinp the form of the distribution of 
abilities but is i piomising starting point in the, investigation of specific distributions 
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influence whose operation modifies the behavior itself in such a way 
as to alter the distribution curve. In other words, any significant devia- 
tion from normality should serve as a “signal” to alert the investigator 
to the need for further research. 

ft is certainly apparent that in the process of test construction tlic 
normal curve is now implicitly treated as a methodological concept, 
rather than as an empirically observed datum. Whenever a non-normal 
distribution is found in the standardizalicm group, the immediate 
response is to set to work revising the test. Most tests have thus been 
deliberately adjusted so as to yield a distribution which approximates 
the normal curvt^ in the population for which they were designed. 
Items arc dropped or added, tests are shifted up or down in the scale, 
scoring “weights” of dilTerent responses are altered, and other similar 
adjustments arc made until the desired approximation to normality is 
attained. To say, then, that a given distiibution is normal may simply 
mean that the process of test standardization was meticulously exe- 
cuted. Conversely, to say that a given distribution is not normal may 
mean only that the construction of the test was crude, or that the test 
was applied to a group unlike the standardization population. 

Tlili MEASURt MHNT OF VARIABILITY 

One is templed to compare the distributions of different traits in 
the effort to discover the relative variability of such traits. Do individ- 
uals dilTer more in physical or in psychological traits? Arc they more 
alike in intellectual or in emo: >nal characteristics? These and many 
similar questions have been raised repeatedly and answers have occa- 
sionally been offered. It is probably .^orrcct to state as a general 
principle that individual differences will ^e larger in the more complex 
than in the simpler traits. Any characteristic which depends upon the 
simultaneous variation of a large number of factors will exhibit more 
marked differences than one which is determined by relatively few 
factors. An illustration from coin tossing will again prove serviceable. 

To argue that psychologists ha\e been “hi in favor of the normal curve 

and that non-noinial di^u ibntioiis ought to be accepted (cf., c.g., 7) is just as mean- 
ingless as the insistence that the normal distribution of behavior has been empirically 
established. With the existing procedures of test construction, non-normal distribu- 
tions are no more independent ot the measuring instrument than are normal dis- 
li ibiitions. 

’■'Cf., for example, the interesting although rather futile discussions by W'cchsler 
(31) and by Ellis (5). The formet treatment fails to come to grips with the funda- 
mental difficulty piesented by arbitrary test zeros, while the latter is vitiated by several 
instances of faulty reasoning and factual error 
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If two coins are employed, the number of possible combinations which 
may result is only four; if, however, the number of coins is increased 
to ten, the possible variations, or patterns of head-and-tail combina- 
tions, total 1024. A complex trait is one which is determined by a 
large number of factors or conditions, and hence it will be expected to 
exhibit a greater range of variation. 

Apart from this rather obvious generalization, little can be said 
about the extent of individual variation in different traits. Upon close 
analysis, in fact, the question of the extent of vaiiability itself appears 
to be ambiguous and quite meaningless. The first problem which con- 
fronts one when trying to compare human variability in separate traits 
is that of the measuring rod employed for the different traits, or the 
units in which the measurements are reported. That the particular 
scale employed affects the amount of variability found is easily dem- 
onstrated. If the height of buildings in one city is measured in feet and 
in another city in yards, the buildings in the former city will seem to 
vary among themselves three limes as much as in the latter, even 
though the actual range in height may be identical in the two cities. 
Fortunately, feet can be translated into yards and vice versa. Bi^t this 
cannot be done with the units of psychological tests. The number of 
problems correctly solved on an arithmetic test cannot be transmuted 
into the same kind of units as words in an analogies test. The only 
solution offered for this difficulty is the use of measures of relative 
variability. 

All indices of relative variability are ratios. One such measure, the 
coefficient of variation, is found by dividing the standard deviation 
(cf. Ch. 2) by the average of the distribution. Thus variability is ex- 
pressed in relation to the average, the difference in units from one test 
to the other being automatically ruled out. For the same purpose, the 
ratio between the highest and lowest scores or the tenth highest and 
tenth lowest, or any other similar combination, is occasionally com- 
puted. Although in current use, all such measures are open to serious 
criticism. The difficulty arises from the fact that psychological scales 
do not measure the individual from a true or ^'absolute zero'' of 
ability as a base. A zero score on the National Intelligence Test, for 
example, does not mean zero intelligence. This test begins at an arbi- 
trary level corresponding to the ability of an average third grade school 
child. Consequently, anyone who fails to reach this level will receive a 
zero score on the test. If such an individual is given a test with a lower 
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“zero point,” such as a first grade or preschool test, for example, his 
score will no longer be zero. A zero score on a psychological test is 
thus an arbitrary zero, which varies from test to test. 

The custom of measuring from “absolute zero'’ in our physical scales 
is So general that it is difficult to conceive of the effects of using a 
scale that begins at an arbitrary zero point. Let us imagine a measur- 
ing stick on which height is measured, not from absolute zero or no 
height at all, but from some arbitrary point such as two feet. The fol- 
lowing diagram illustrates tlic situation. Any object two feet or less in 

I 1 1 1 1 1 \ 1 

0 1 2 3 4 5 

4 -FOOT BOY 6 FOOT MAN 

height would register zero on this scale. If such a scale were to be 
employed only to mcasutc the heights of individuals over five years of 
age, the arbitrary limit would perhaps not appear so absurd, since no 
one would be under two feet tall. This is in fact what has occurred in 
the construction of psychological tests. Since the AGCl , for example, 
was designed lor adult men, it would have been wasteful, and from a 
practical standpoint impossible, to extend it down to the intellectual 
level ol a newborn child. 

To return to oui yardstick with an arbitrary zero point at two feet, 
let us suppose that it has been used to measure the heights of a six- 
foot man and a four-foot boy The man will measure four feet and 
the boy two feel, as has been indicated on the diagiam. For many pur- 
poses, no error has been int' duced in the data by the use of the 
artificial zero point. On any scale, the man is two feet taller than the 
boy. If, however, we express their res’'»ectivc heights as a ratio, we 
reach the conclusion that the man is twi<'c as tall as the boy ( ) . This 
is not true of their actual Iieights from absolute zero, the man being 
only ]V2 times as tall as the boy The subtraction of a constant, 
two feet, from both heights has distorted the ratio. 

Such is the effect of an arbitrary zero point on any value which 
involves the division of one measure by another. For this reason, ratio 
or other relative measures cannot be employed in comparisons among 
the large majority of psychological tests, which arc not scaled from 
absolute zero.^^ Such measures would hold true only for the specific 

The only important exception to date js the CAVD IntclliKcnce Examination, 
prepared by the Institute of fcUicational Research at Icaclurs College, Columbia 
University (cf. Thorndike, et al 26). 
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tests in the form in which they were employed; the addition or re- 
moval of a few easy items at the lower end of the scale would com- 
pletely alter the relative variabilities. Obviously the values thus com- 
puted could not be regarded as very meaningful. We thus arrive at the 
conclusion that with available psychological tests it is impossible to 
compare variability from one tiaif to another. 

Other difficulties also appear as the problem is inspected more 
closely. Docs the question of the extent of variability refer to the 
whole human race? Which individuals, if any, shtxll be omitted in order 
to arrive at an estimate of human variability? Shall those who are 
regarded as definitely pathological and represent extreme deviations 
be excluded? If so, where should the line be drawn between a typical 
human group and an abnormal deviant? It seems reasonable, for ex- 
ample, to exclude from an estimate of the range of human variation 
in speed of movement one who has suffered an injury which renders his 
movements slow and halting. It is but a short step Irom this procedure 
to that which \\ould exclude those incapacitated through dhease. How, 
then, would tins criterion operate ii\the case of a feebleminded person 
in whom no physical defect can be discoxcred? How far sha^l this 
process of eliminating extreme cases be carried"^ 

A further ciucslion relates to the factors which arc to be held con- 
stant in measuimg the \ariabillty of any one irait. How homogeneous 
should the group bc‘^ The inclusion of children of different age levels 
would certainly increase the extent of variation in most traits. If only 
the range of individual differences x\ithin a fairly homogeneous popu- 
lation is desired, the difficulty of defining the leqiiircd degree of 
homogeneity is encountered. Many traits arc inlluenced by the social 
and economic level in which the individual ffnds himself. vShouid con- 
ditions of this sort also be held constant? Should difference^ in speed 
of performance be ruled out w'hen determining variability in “intelli- 
gence''? Such questions could be raised ad iniinhmn unless an arbi- 
trary limit is set up and adhered to consistently for the purposes of 
some one particular investigation. 

Wc may conclude from this analysis that the question of the extent 
of individual differences in different traits cannot be answered unless 
put in very specific terms. The population must be defined in detail 
within each investigation and the nature of the trait measured must be 
made clear, especially by indicating which conditions are to be held 
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constant and which will be allowed to vary. Obviously all hereditary 
and environmental conditions which alTcct a given trait cannot be held 
constant; otherwise variation would disappear. It should be added that 
at the present stage in the development of mental testing, owing to the 
use of incomparable units and arbitrary zero points, the question can- 
not be answered at all, in any form. 

INDTVTUUAL IMI T ERENC I S IN INFRAHUTvlAN GROUPS 

Individual differences are not to be regarded as characteristically 
human. Variation is a universal phenomenon throughout the organic 
scale, “All cals look gray at night,” but upon closer inspection each be- 
comes an iiidividual in his own right. Cursory or inadequate observa- 
tion often creates an impression of similarity or even identity among 
members of a group, while the differences pass unnoticed. For this 
reason, only the extreme deviants among animals have attracted atten- 
tion in the past, all other members of the species having been implicitly 
relegated to tlic limbo of “normality.” 

Several cases of exceptionally “gifted” animals^" have been de- 
scribed by their trainers or by observers, the remarkable feats of the 
animals having aroused the wonder and admiration of spectators. 
Among the most famous examples is Clever Hans, a stallion pur- 
chased in 1900 by a Mr. Van Osten of Berlin and subsequently trained 
by him. The horse was first taught a conventional alphabet in which 
each letter was represented by a certain combination of taps with the 
forefoot. Digits were indicated oy tlie appropriate number of taps. By 
this system, the horse learned to “count” objects presented to him and 
also to perform all forms of simple ai thmetic operations. He could 
handle fractions, first changing them into decimals. He wns able to 
give the correct answer to such a problem as the following: “T have 
a number in mind; I subtract 9 and have 3 as a remainder; what is 
the number?” He seemed to read German readily, and if presented 
with a series of cards containing written words, lie v ould step up and 
point with his nose to any words require. ^ rf him. He answered simple 
questions put to him orally, tapping out each letter of the answer in 
his conventional alphabet. He could give the date of any day one 
might mention, would tell time to the minute, and was able to analyze 


For a fuller discussion, set Vvalson (30, Ch. IX) and Tinklepaugh (28). 
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a discordant clang, telling his observers which note should be changed. 

Most of these feats are not, to be sure, as remarkable as they appear 
at first glance. 1 hus, it was found that Clever Hans was unable to 
respond correctly to a problem if no one present knew the answer. 
Likewise, when the observers were concealed, the horse failed. The 
unusual achievements of Clever Hans and of many other performing 
animals result, not from an understanding of arithmetic or an ability 
to read, but from an exceptionally keen observation of slight cues given 
by the observers. The trainer, or other persons picsent, will make some 
slight gesture, such as lifting the head a few millimeters, as soon as the 
animal has tapped the correct number of times. Such cues, it may be 
added, are usually given unintentionally and unconsciously. They may 
be too slight to attract the attention of spectators, but an observant 
animal will learn to respond to them. Although destroying some of the 
glamour which such feats have had for the public, this explanation does 
not imply that the task of learning to observe and respond to the 
proper cues is an easy one which any animal could accomplish. 

There remain, furthermore, the cases of animals who have been 
shown genuinely to respond to a wide variety of verbal comiUcgids in 
the absence of any other cues, or who have learned intricate combina- 
tions ot movements, or have in many other ways proved their ability 
to react to very complex situations. Performing dogs, such as “Fellow” 
w^ho could respond to approximately 400 words and execute the same 
commands even when worded diflerently, have been repeatedly ex- 
hibited. “Seeing Eye” dogs who lead the blind show a remarkably 
keen adjustment of their responses to the changing demands of the 
situation. Chimpanzees have been taught a wide variety of acts, such 
as skating, riding a bicycle, eating with knife and fork, unlocking 
doors. The performances of circus animals, and especially “musical” 
sea lions, arc well known. The observation of such animals, even when 
stripped of popular overstatement, still yields instances of marked 
individual differences. 

Nor is the evidence for individual vaiiation among infrahuman 
animals confined to the study of unusual cases. F very laboratory in- 
vestigation employing more than one subject has revealed individual 
differences.^® Animal psychologists have not as a rule been concerned 

for example, Ihc discussion ol this problem from \arjoiis angles by 
Tryon (29). 
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TABLE 3 Some Typical Data on Individual Differences in 
Injrahiiman Orgaimms 
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with the measurement of variability, so that the data on this problem 
arc usually mentioned only incidentally and frequently are not given 
in quantitative form. Whenever such data are reported, however, the 
range of performance in a randomly selected group is surprisingly 
large. Wide individual variation has been found in every phase of 
behavior investigated, such as the amount of gcneial spontaneous 
activity, the relative strength of drives, emotionality, speed of move- 
ment, quickness of learning simple tasks, and behavior in more com- 
plex pioblcm-solving situations. Some typical quantitative results on 
learning behavior have been brought together in Table 3. The average, 
range, and standard deviation for each set of data have been given 
whenever available. 

The first set of data is taken from experiments on conditioning. 
Two stimuli, such as a flash of light and an electiic shock to the foot, 
are presented together. After a numbei of combined repetitions of 
these stimuli, the withdrawal response becomes conditioned to the 
light, i.e., the animal will withdraw its foot upon appearance ol the 
light alone, without the piescnce ol the electric shock. It is customaiy 
in such an experiment to refer to the original stimulus (in this ca^c, the 
shock ) as the conditioning stimulus, and to the other as the conditioned 
stimulus. I he general natiiic of the conditioning and conditioned 
stimuli employed in each experiment has been indicated in Table 3, 
together with the type and number of animals investigated. Tt will be 
noted that the number of combined repetitions of the two stimuli re- 
quired to establish the conditioned leaction dilTers widclj from indi- 
vidual to individual within each group. 

Another set of data is furnished by a series of learning projects 
conducted at the Columbia University laboiatoiy of comparative psy- 
chology. Small samplings of guinea pigs, albino rats, common short- 
haired cats, and monkeys of two species were tested with the same 
type of ‘'pioblcm box,” in which a seiics of steps of increasing com- 
plexity was presented to the animal. The box consisted essentially of 
an outer and an inner cage, the latter containing the incentive which 
the animal obtained at the completion of each successful trial. In the 
outer cage were three plates to be depressed in a given order by the 
animal before the door to the incentive compartment was opened. In 
Table 3, only the number of trials required to learn step I are repro- 
duced, since this was the only step learned by all the groups. The 
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Quality of Performance or Racing Capaci^ Shown 

Fig. 32. A NorriKil Dislrihuliu.j Cuivc of Riicing C.ipacity, vShowing the 
I'icld ol 24 Horses “Nearing ihc Line” in ine Derby Stake.s al Lpsom 
Downs. (y\ftcr Laughlin, J6, p, 213.) 

problem in step I conslstCil simply in stepping on the first plate to the 
right as the animal entered the box.^^ The other steps involved stepping 
on plates J and 2; 1,2, and 3; 1, 2, 3, and hack to 2; 1, 2, 3, 2, ]; 
and so on to other combinations. 

Although these problem box studio's were conducted mainly to 
determine the highest number of step.'* which any animal within a 
given species could master, the data yield striking evidence of individ- 

Tn the study on cats, tlic problem set in step I was simpler, the animal being, 
allowed to step on anv one ol the three plates. The data on this group are therefore 
not diteclly comparable to those cii the other species. 
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ual differences within each species. Not only the number of trials 
required to learn each step, but also the number of steps which could 
be learned, differed from individual to individual. In the group of 
guinea pigs, some were unable to learn even step 1, while others suc- 
ceeded; among the rats, some learned two steps, some one, arid a 
few none; among the cats, the range is from 3 to 7 steps; among the 
rhesus monkeys 2 to 22, and among the cebus 5 to 15. Thus the indi- 
vidual variation was so large that an individual could easily be found 
in a “higher” species who was unable to learn a.^ much as a given indi- 
vidual in a “lower” species. 

in the third section of Table 3 are presented some typical data on 
maze learning among albino rats. The individual differences are again 
marked, as is indicated by the standard deviations of the number of 
trials required to master the correct path in each maze. It is thus 
apparent that close observation and measurement of animal behavior 
reveal fully as much individual variability as the studies on human 
subjects. 

An interesting example of the normal distribution curve in a func- 
tional trait in animals is to be found in the photograph and accom- 
panying curve reproduced in Figure 32. T he photograph shows Tiorses 
on the race track just before the finish. The relative position of the 
horses furnishes a vivid demonstration of the normal distribution of 
racing performance. A few are in the lead, an equally small number 
lag behind, and the majority are scattered in intermediate positions. 
The graph is a frequency curve of the “racing capacity” of the same 
horses, computed by a standardized formula. 
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Win IX) iNHiviDUM s nil n r from one another? What are the factors 
which produce variation? These questions l)a\e stimulated prolonged 
discussion and led to lively controversy. In addition to its fundamental 
theoretical importance, the problem of the causation of individual 
dilTeiences has tar-reaching practical significance in many fields. Any 
procedure involving the control of human development must be based 
upon an understanding ol the factors which inlluence such develop- 
ment. All educational methods make some assumption regarding the 
causes of individual diflerences. Is the main function of education to 
pioduce certain desirable traits, or merely to ofler opportunities for 
the development ot the child’s ‘‘potentialities”*^ Volumes have been 
devoted to argumentative and fieqiicntly verbose analyses of this ques- 
lion. The empirical atcumulatioii oi facts on the causes of individual 
variation alone can furra>h a conclusive answer. 

1 he type of educational activities, voc\ ions, and other pursuits tra- 
ditionally allotted to men and women rests upon certain beliefs regard- 
ing the cause of sex diderences in psychological traits. Relationships 
among racial and national groups, as well as attitudes toward various 
groups, arc based upon theories — either implicitlv assumed or cwertly 
stated — regal ding the origin of racial and iiatumal characteristics. 
Any caste system implies a hereditary u rentiation of people. Al- 
though not formally presciibed, such systems still prevail widely, fre- 
quently operating in vocational choices and many other situations of 
everyday life. The interpretation of familv resemblances, and even in 
some cases the development of family groupings themselves, rests upon 
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specific underlying hypotheses regarding the causal factors in human 
resemblance and dissimilarity. 

THE NATURE OF HEREDITY ^ 

The basis of individual diflcrences is to be found in each individual’s 
hereditary background and in the environmental conditions under 
which he has developed. Let us first consider what, specifically, is 
meant by “heredity.” It need hardly be mentioned, of course, that as 
herein used the term ‘‘heredity” signifies biological heredity. It is only 
figuratively that we speak of “social heredity,” as in such expressions 
as “the cultural heritage of the twentieth century” or the “inheritance 
of the family fortune.” So-called social inheritance actually falls under 
the heading of environmental influences. 

Basically, an individual’s heredity consists of the specific genes 
which he receives from each parent at conception. To call a certain 
inllucnce, factor, or characteristic hereditary should thus mean that it 
can ultimately be traced to the presence of a particular gene or com- 
bination of genes. The genes are grouped into chromosomes, or “tol- 
ored bodies,” so named because they become visible within the cell 
nucleus when the cell is stained with certain dyes for observation. 
Chromosomes occur in pairs, the two members of each pair being 
similar in appearance and function. The number of chromosomes in 
each cell is, in general, constant within each species, but clifTers from 
one species to another. Each human cell, for example, contains 48 
chromosomes (24 pairs); in each cell of the mosquito, there are 6 
(3 pairs); and in each cell of a certain species of crayfish, there are 
200 (100 pairs). 

Chromosomes arc visible under a microscope, appearing as rod- 
like, sausage-shaped, or V-sluiped bodies (cf. Fig. 33). The genes 
within each chromosome, however, are so minute as to be invisible, 
even with a high-power microscope. Through the observation of giant 
chromosomes wdiich have been discovered within the salivary glands 
of certain species of flies, it has proved possible to examine the inter- 
nal structure of chromosomes more fully under the microscope. Al- 

^ To fia out the vciy brief sketch of the mechanism of heicdity which follow'' the 
reader is urged to consult any recent text on genetics, such as Sruiott and Dunn 
(24) or Sn>der (25). for a discussion ot the corcept of heredity, cf. Holt (13), 
Jennings (14), and Mullei, I ilile, and Snyder (I^^O. A very readable popularized 
account of heredity is offered by Schemfeid (22). 
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though in volume they are from 1000 to 2000 times larger, in other 
essential characteristics these giant chromosomes arc like those found 
in other body cells. Figure 34 shows a segment of a giant chromosome 
from the salivary glands of the fruit fly, Drosophila melanogaster. 
Even in such photographs, however, direct observation of the genes 
themselves is not possible. More recently, the development of the 
electron microscope, which produces a much higher degree of magnifi- 
cation, has oflfered new opportunities for investigating the internal 
structure of chromosomes and may 
ultimately permit a more direct study 
of tiie nature of genes (2). 

In the normal process of cell divi- 
sion, or mitosis, every chromosome 
is duplicated by splitting longitudi- 
nally along its entire length. Each 
cell resulting from this division re- 
ceives an identical set of chromo- 
somes. All cells in the body thus 
have identical heredity. That some 
develop into eye cells, others into 
skin, bone, or any of the other varie- 
ties of body cells depends upon the 
influence of the cellular environment. 

Such conditions as gravity, p’-essiirc, 
availability of oxygen and other 
chemicals, and electrical fields oper- 
ate dilTerentially upon individual cells, 
depending upon the position of the cell in relation to other cells.^ 
It is believed that the genes, which have been described as “minute 
packets of chemicals,’’ act as catalysts in these interactions between 
the cell and its environment. 

When the individual has attained sexual maturity, a different type 
of cell division occurs in the formation the specialized reproductive 
cells, the ova of the female and spermatozoa of the male. This process 
is known as reduction division, since the chromosomes in each repro- 

2 Technically, this means that “physiological giadients” of development are estab- 
lished, such as surfacc-inierioi, doiso-venlial, or antcro-posterior gradients (cf. Child, 
6, especially Ch. VI 11 and IX). 



Fig. 33. Human Chromo- 
somes as Seen Under a Micro- 
scope. (From Evans and 
Swezy, 8.) 
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ductive cell are reduced to one-half the original number. Instead of 
duplicating, as they do in mitosis, the two chromosomes in each pair 
separate, one going to each daughter cell. It should be noted that in 
this type of cell division each cell may receive a different combination 
of chromosomes, since the chromosomes in each pair assort at random. 
Moreover, the chromosomes arc not always segregated as units into 

the different daughter cells, but 
’’ segments of one chromosome 

may combine with segments of 
f another (“crossing over”), thus 

I S increasing the variety of possible 

: H combinations of genes in the indi- 

M vidnal daughter cells. When the 
i ^ S ovum of the mother unites with 

; ^ * 1 #^ flici spermatozoon c'f the father in 

^ the process of fertilization, the 

! ^ full number of chromosomes is 

a restored and remains through 

subsequent mitosis of the develop- 
f ing offspring. 

^ The hereditary basis for indi- 

^ vidual dilferences is furnished by 

the almost unlimited variety of 
^ possible gene combinations which 

may occur, especially in such a 

Fig. 34. Giant Chromosome from complex organism as man. It 

the vSaliv.'iry Chand of the Fruit Fly. should be noted, first, that even 

(From Painter, 21, p. 464,) simple human characteristics gen- 

erally depend upon the combined 
influence of large numbers of genes. Secondly, the individual germ 
cells of each parent organism contain diffcient combinations of 
genes, as a result of the process of reduction division. Thirdly, 
the cells of two organisms, the mother and the father, combine to 
produce the new organism, thereby further increasing the variety of 
possible gene combinations. It should thus be apparent that no two 
siblings (i.e., brothers or sisters) will have identical heredity. The 
same is true of fraternal twins, who, although born at the same time, 
develop from separate germ cells and are no more alike in heredity 
than ordinary siblings. Fraternal twins may be of the same or opposite 
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sex, and may be quite unlike in appearance. Identical twins, on the 
other hand, develop from the division of a single fertilized ovum and 
therefore have identical sets of genes. Such individuals are complete 
duplicates as far as heredity is concerned. 

The simplest illustration of the mechanism of heredity is furnished 
by unit factors, which depend upon a single pair of genes. An example 
of such a unit factor is albinism, or the absence of pigmentation in 
the eyes, hair, and skin. If the individual received a gene for albinism 
from each of his parents (cc), he will himself be an albino. Individuals 
with two genes for n(')rmal color (CC) will have normal pigmentation. 
Botii of these individuals are described as ln)nio:y<^oiis with respect to 
albinism. 1 his simply means that the fertilized ovum, or zygote, from 
which such individuals developed received like genes for albinism or 
for normal coloring from both parents. If an individual received the 
gene for albinism from one parent and the gene for normal coloring 
from the other parent (Cc), he is said to be iictcr(>zyyous in this char- 
acteristic. Such an individual will show normal coloring, since normal 
coloring is dominant and albinism is recessive. In other words, 
albinism, being a recessive factor, appears only when the individual 
has received the recessive gene for albinism from each parent. 1^hc 
heteiozygous individual (Cc), although himself normal in coloring, 
neverdieless carries the recessive gene for albinism, which he may in 
turn transmit to his offspring. 

In the case of other unit fa^dors, the heterozygous individuals may 
exhibit blending rather than dominance. For example, in poultry, black 
and splashed- white coloring arc a corresponding pair of unit factors, 
but neither is dominant. A cross-breed v f these two varieties of poultry 
will produce individuals of a third color, known as “Blue Andalu- 
sians,” unlike either of the two parents. 

The sex of an individual is itself determined by a pair of chromo- 
somes, known as the sex chromosomes, and designated X and Y. If 
the child receives an X chromosome from each parent, it wall be a 
female; if one X and one Y chromosi :ae arc received, a male will 
result. From its mother, the child can receive only X chromosomes; 
while the father can pass on either an X or a Y chromosome. The 
Y chromosome is relatively small and is believed to contain very few 
genes. Sex differences in a number of other characteristics may occur 
because of specific genes carried by the X chromosome. Several such 
sex-linked characteristics have been identified, among the best-known 
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examples being color-blindness and hemophilia/'^ Both of these condi- 
tions depend upon a recessive gene carried in the X chromosome. If a 
daughter inherits this factor from one parent only, the dominant nor- 
mal gene in the other X chromosome will prevent the appearance of 
the defect. Thus a girl will show the defect only if she inherits the 
defective gene from both parents. In the case of a boy who receives 
an X chromosome with the defective gene, on the other hand, the 
defect will invariably appear, since there is no corresponding normal 
gene in the Y chromosome. Consequently, such characteristics are 
more common among males than among females. 

Certain other factors, such as baldness, are sex-influenced, i.e., 
they behave as dominants in one sex and as reccssives in the other. 
Thus baldness will develop in a male if the gene for baldness was 
transmitted by cither parent. In the female, it will develop only if 
genes for baldness were received from both parents. Still other factors, 
known as sex-limited, are present in both sexes, but their expression 
is inhibited in one sex by the presence of the sex hormones. Many of 
the physical differences between the sexes arc probably based upt>n 
this type of factor. Destruction or improper functioning of the endo- 
crine sex glands can thus bring about changes in the development of 
these characteristics. 

It should be noted that whenever a characteristic depends upon a 
single pair of unit factors, the result will be distinctly identifiable types 
which differ qualitatively from each other. Most trails, however, de- 
pend upon multiple factors, the number of resulting combinations 
increasing rapidly as the number of contributing factors increases. 
With even a relatively small number of contributing factors, the re- 
sulting individual differences are quantitative and their distribution 
may approximate the normal curve. Body height is an illustration of 
such a multiple-factor characteristic in the human. 

In the case of certain multiple-factor characters, the appearance or 
non-appearance of the character itself depends upon a unit factor. 
In other words, the operation of the multiple factors is itself depend- 
ent upon the presence of a specific gene, which may thus be regarded 
as a limiting condition. The illustration of albinism may again serve 
in this connection. It is now known that the determination of human 

3 A condition in which the blood fails to clot and the individual may theiefore 
bleed to death even from a slight scratch. This condition attracted especial notice 
because of its occurrence in ceilain loyal families of Europe. 
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eye color depends upon the presence of several pairs of genes. Differ- 
ent combinations of such genes produce the almost continuous grada- 
tions of observable eye color. If, however, an individual has received 
the unit factor for albinism from poth parents (cc), he will be an 
albino regardless of what combination of eye-color genes he may have. 
The latter are rendered inoperative in the determination of his eye 
color by the presence of the pair of genes for albinism. Similarly, the 
spotted coat found in certain breeds of cattle results from a single 
recessive factor. But the degree of spotting varies along a virtually 
continuous scale and depends upon a number of modifying multiple 
factors. This type of relationship is especially relevant to the possible 
role of heredity in the development of some psychological character- 
istics. We shall, in fact, have occasion to refer to it again in our discus- 
sion of certain types of feeblemindedness (cf. Ch. 16). 

Finally, mention should be made of the concept of genic balance.*' 
For purposes of analysis, the biologist must necessarily study the influ- 
ence of particular genes upon the development of each characteristic. 
We must remember, however, that every characteristic actually results 
from the interaction of all the genes which the individual has inherited. 
Snyder (25, p. 232) summarizes the contemporary viewpoint of 
geneticists on this point as follows: 

A gene always exerts its effect in the presence of other genes; hence has 
arisen the idea of genic balance, by which is meant that any character is 
the result ol the entire gene conij ’ex acting in a given environment. Varia- 
tions in a character may be produced by variations in a single gene, but 
always in the presence of the rest of the g*^nes. 

THE NATURE OF INVIRONMENT 

The concept of environment also requires some clarification. The pop- 
ular definition of environment is a geographical or residential one. A 
child is said to have a ‘‘poor environment,” for example, because he 
lives in the slums. Or his “environment” (s characterized as a French 
village, an American small town, or a Welsh mining community. Psy- 
chologically, such descriptions of environment are highly inadequate. 
It cannot be concluded, for example, that an 8-year-old boy and his 
5-year-old brother standing in the same room at the same time have 
identical psychological environments even at that moment. The very 
fact that the current environment of the former includes the presence 
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of a younger sibling and that of the latter the presence of an older 
sibling constitutes a significant psychological difTcrcncc. Moreover, th: 
differing backgrounds of past experience of the two siblings will m 
turn cause a difference in what each gets out of the present situation. 
One point is obvious from this illustration: the fact that two children 
have been brought up in the same home is no indication that they have 
had identical psychological environments. 

Psychologically, environment consists of the sum total of the stimu- 
lation which the individual receives from conception until death. This 
is an active concept of environment, i.c., the physical presence of 
objects does not in itself constitute environment unless the objects 
serve as stimuli for the individual (cf. 15, 29). This definition is also 
more inclusive than the popular one, covering all forms of stimulation 
and extending over the entire life cycle. 

The importance of the prenatal environment in determining the 
individuafs development has been fully demonstrated. Variations in 
diet and nutrition, glandular secretions, and other physical conditions 
of the mother, for example, may exert a profound and lastijig influence 
upon the development of the embryo. T hat the structural development 
of the organism is definitely inlluenced by early environmental factors 
is clearly indicated by a number of experimentally induced alterations 
in lower animals. 

A curious transformation can be environmentally produced in the 
axolotl, a large salamander (cf. 14, pp. 117, 124-125). Normally, 
this animal has prominent external gills, a large tall adapted for swim- 
ming, and other characteristics suited for aquatic life. If the young 
axolotl is fed on thyroid, it loses its gills and its body becomes gen- 
erally altered so that it is no longer adapted to swimming. The animal 
then becomes a land salamander, known as Amblystoma, and returns 
to the Ne ater only to lay its eggs. 

In the fruit fly, a defective gene causes the animal to produce “re- 
duplicated legs," i.e., certain joints of the legs, or entire legs, are dou- 
bled. Although the inheritance of this defective gene has been definitely 
traced, this characteristic will not appear under certain environmental 
conditions (12). When animals known to have the defective gene are 
kept at a sufficiently warm temperature, the additional leg or joint will 
not develop. Successive generations bred under these conditions will 
have a normal appearance. If, however, any of their offspring are 
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allowed lo develop in colder temperatures, the defect will reappear. 
This furnishes a definite illustration of the fact that even a clearly 
demonstrable “inherited defect” is actually only a tendency to develop 
in a given way under certain environmental conditions. 

Experimentally produced “monsters” represent conspicuous exam- 
ples of the influence of prenatal environment (26, Ch. VI and VII). 
In experiments on fish eggs, “siamesc-twin” fish have been produced 
by artificially inhibiting or slowing down the rate of development at 
an early age through low temperature, insullicicnt oxygen, or ultra- 
violet rays. In some cases, one twin is much smaller than the other 
and i‘ deformed, the larger twin being a perfectly normal fish. Two- 
headed monsters have been produced among tadpoles and several 
species of fish by the application of various chemical or mechanical 
stimuli. 

Fundamental variations in the number and position of the eyes of 
minnows have likewise been artificially induced. If the eggs of the 
minnow arc alk)\vcd to develop in sea water to which has been added 
an excess of magnesium chloride, peculiar eye conditions will appear 
in a large majority of the embryos. Instead of the usual two eyes, many 
will develop a centrally placed “cyclopean” eye, so named after the 
one-eyed Cyclops of mythology. Others may show a single lateral eye, 
placed to the right or left of the head. Or the two eyes may be abnor- 
mally close together. Some of these artificially produced monsters are 
shown in Figure 35. 

Other physical or chemical agents may be employed to produce the 
same anomalies of development. 1 he primary determining factor in 
the development of a particular abnorni.dity seems to be the stage at 
which the agent is introduced, rather than the nature of the specific 
agent employed. The essential effect is a change in the rate of develop- 
ment, which alters the balance of growth among the different parts of 
the organism. In commenting Upon these experiments, Stockard writes 
(26, pp. 109-110): 

In other words, ihu genetic composition c.1 these fishes causes them to 
develop two eyes in normal sea-water, but the same genetic composition 
gives rise to a single cyclopcan eye when an excess of magnesium chloride 
is added to the sea-water. If sea-water normally had the composition 
which causes fish to develop \\ith the cyclopcan eye, and an experimenter 
should develop the eggs of fish in a solution of the same composition as 
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our ordinary sea-water, he would find them giving rise to fish with two 
lateral eyes instead of the median one, and these two-eyed specimens 
would appear to this imaginary investigator as monsters. 

Thus we cannot even speak of certain structural characteristics as 
being “normal” for a given species and fixed by hereditary constitution. 
If the environment in which the organisms develop were to undergo a 
change of a more or less permanent nature, a different set of charac- 
teristics would come to be considered normal. Similarities of develop- 



Fig. 35. Onc-Eyed “Cyclopean” Minnows Resulting Irom Environmental 
Conditions. (Fiom Stockaid, 26, p. 109.) 


ment are attributable to common exposure to an essentially similar 
environment as much as to the possession of common genes. 

Observations on various species have also demonstrated consider- 
able behavior development during prenatal life, as well as the influ- 
ence of specific conditions of the prenatal environment upon such 
development (cf. 5). The “zero-point” of behavior falls wefi before 
birth, the “behavior age,” or “mental age” at birth varying widely 
from species to species (5). Stages of motor development have been 
clearly established in the embryos of many animals. Sensitivity to 
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various types of stimuli has also been noted early in prenatal life. 
Hence the fact that various functions may have been exercised before 
birth cannot be ignored in the study of subsequent behavior develop- 
ment. The possibility of conditioning to changes in temperature, pres- 
sure, and other stimuli in the prenatal environment must likewise be 
taken into account. Investigation of prenatal learning opens an interest- 
ing field of research into the origins of behavior. 

Finally, it should be noted that, with increasing precision of defini- 
tion, the concept of environment has gradually broadened, and that it 
has also become less sharply distinguishable from the concept of 
heredity. The popular identification of environment with “external” 
and heredity with “internal” influences has had to be discarded in the 
light of increasing knowledge of the operation of heredity and environ- 
ment. In the preceding section, reference has already been made to 
inter-celliilar environment, i.e., the environment consisting of sur- 
rounding bodily cells, in which each individual cell develops. The 
important role of this cellular environment in the establishment of 
gradients and in other developmental processes is now recognized. 

Carrying the analysis still further, we should also consider the intra- 
cellular environment. It is obvious that the genes exert their influence 
in an environment consisting of the cytoplasm of the cell. The role of 
the intra-cell ular environment is especially important after some dif- 
ferentiation has occurred in the process of cell division. Cells which 
contain identical genes but different cytoplasmic structure will differ 
in their ultimate development. 1 ne original differentiation occurs under 
the influence of the genes, but once it has taken place, it in turn affects 
the further action of the genes. ^ It should be added that each gene 
must also be regarded as operating in an envhonment of other genes 
within any one cell. This mutual interdependence of genes is what is 
meant by the concept of genic balance, discussed in an earlier section. 

From a slightly different angle, mention may be made of the fact 
that genes themselves, the essential element in any definition of hered- 
ity, are not completely immune to envhi u'nental influences. Experi- 
ments with various types of radiation, including X-iays, radium rays, 

^Geneticists have proposed thai the genes may opeiate as enzymes or catalysts, 
inducing chemical changes in the cyptoplasm without themselves becoming altered 
The enzymatic action of a particular gene may piodiice different results (or no 
result at all), depending upon the specific chemicals in the cytoplasm of a particular 
cell. This theory does not preclude the possibility that genes may also exert their 
influence in othei ways. 
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ultraviolet light, and heal rays, have demonstrated the susceptibility 
of genes to such influences. Similar effects have more recently been 
obtained with certain chemicals (7). Since the genes themselves are 
affected, the changes produced by these agents arc not only manifested 
in the immediate offspring, but are transmissible to future generations. 
In such a case, a hereditary variant (mutation) results from the oper- 
ation of an environmental factor. These experiments serve to demon- 
strate further the fineness of the line which separates the operation of 
heredity and environment, 

THH IIERT niTY-rNVIRONMrNT RI LATIONSIITP 

The early concept of “instinct.” still prevalent in much popular think- 
ing, implied the existence of behavior which is wholly hereditary. Ihe 
classification of behavior into “instincts" and “habits,” corresponding 
to “native behavior” and “acquired behavior,” respectively, assumed 
the excliisi\e operation of either heredity or environment within a 
given activity. Such a theory, implying the hereditary transmission of 
behavior functions in toto, has been quite generally superseded in con- 
temporary psychology. It is now rccogni/ed that every trait fc)f the 
individual and every reaction which he manifests depend both upon 
his heredity and upon his environment. Although commonly admitted 
to be untenable, tlie belief that psychological characteristics can be 
separated into those which are inherited and those which are acquired 
is implied in various lc»o‘-ely expressed gencrali/ations about the in- 
heritance of behavi(^r characteristics. Discussions regarding the “in- 
heritance” of intelligence, special talents, or insanity, for example, 
frequently leave the impression that the inheritance of the behavior 
itself was meant. Nor aic more recent and more sophisticated psycho- 
logical writings wholly free of such implications. Upon careful con- 
sideration, however, it is apparent that hereditary and environmental 
factors cannot be so glibly separated, nor can behavior be naively 
divided into that w^hich is inherited and that which is acquired. 

A second possible way in which the heredity-environment relation- 
ship may be conceived is in terms of additive contribution. According 
to this view, both heredity and environment contribute to all behavior 
development, and the resulting behavior characteristics can be ana- 

^ Much of Ihc present section is based upon a recent article by Anastasi and 
Foley (1) 
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lyzed into the sum of hereditary and environmental influences. That 
heredity and environment contribute jointly to the development of 
behavior is undoubtedly the most widely held view, but the additive 
assumption regarding their operation is rarely expressed as such. Just 
this assumption, however, underlies all attempts which have been 
made to determine the proportional contribution of heredity and en- 
vironment to the development of particular behavior characteristics.'^ 
A statement that “licrcdity contributes 75% and environment 25% 
to the devclopmenl of irilelligence/’ for example, would illustrate this 
additive approach. It might be noted that the same investigators who 
have offered such estimates of proportional contribution have occa- 
sionally argued against the additive view of heredity and environment, 
apparently unaware of the inconsistency in this procedure (cf., c.g., 
Burks, 3, 4). 

Jhc most widely expressed view of the heredity-environment jcla- 
lionship is that of inieniciion. This means primarily that the ellects of 
hereditary and environmental factors arc not cumulative or additive, 
but rather that the nature and extent of the influence of each type of 
factor depend upon the contribution of the other. In other words, any 
one environmental factor wall exert a differew influence depending 
upon the specific hereditary material upon which it operates. Similarly, 
any hereditary factor will operate diflerently under dillerent environ- 
mental conditions. It is apparent that any estimate of the proportional 
contribution of a hereditary or environmental factor is inconsistent 
with this viewpoint, since the proportion would vary as cither heredi- 
tary or environmental facton, varied. To the question, “What is the 
relative contribution of heredity and environment to individual ditfer- 
enccs in, let us say, IQ?" there would thus be an infinite number of 
possible answers. 

As an illustration of this point we may consider a non-psychological 
characteristic wdiose heredity is knowm. The number of facets in the 
eyes of the fruit fly, Drosoplulu, have been found to vary widely in a 
number of types which differ in their gene consdlution. The tempera- 
ture at which the larvae are kept also Je'ermines the actual number 
of eye-facets w^hich develop. The interaction of these two factors, 
hereditary and environmental, is illustrated in Figure 36. This graph 

® An extensiv'' analysis of the implications of Uic concept of “proportional con- 
tribution,” as applied to the heredity-environment problem, is to be found in 
Loevinger (18). Cf. also Schwesinger (23). 
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shows the effect of temperature upon the number of eye-facets in two 
types of individuals differing in genetic constitution, which for con- 
venience have been designated “genetic type A” and “genetic type B” 
on the graph. It will be noted that the form of the curve differs for the 
two genetic types. The difference in number of eye-facets between the 



Fig. 36. An Illustration of the Interaction of Hereditary and Environ- 
mental Factors: Number of Eye-Facets in Drosophila as a Function of 
Genetic Constitution and of Temperature. (From Hogben, 11, p. 96.) 


two genetic types was much greater at 16*^ than at 25^. Conversely, 
the effect of temperature was greater on one genetic type than on the 
other. Thus, a “different difference” resulted from environmental 
changes when operating on individuals of different heredity; and a 
“different difference” resulted from hereditary variations when operat- 
ing in different environments. The “ratio” of hereditary and environ- 
mental contributions would thus vary as either factor varied.^ 


For other illustrations, cf. Haldane (10) and Hogben (11). 
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The operation of a similar type of heredity-environment relationship 
can readily be recognized in the development of many familiar human 
characteristics. If we ask, for example, to what extent body weight 
depends upon such environmental factors as diet and exercise and to 
what extent it depends upon hereditary factors, no single answer can 
be given for all individuals or all environmental conditions. Because 
of differences in hereditary factors, the body weight of certain indi- 
viduals is more susceptible to differences in diet, exercise, etc., than 
that of other individuals. In the former type of person, the contribution 
of heredity is smaller. Thus, the proportional contribution of heredity 
and environment to body weight may itself be determined by heredi- 
tary factors, and may vary from person to person. For example, for 
some individuals diet and other environmental factors might contribute 
10% to the determination of body weight, for others 80%. The pro- 
portional contribution (>f heredity and environment may likewise be 
altered by variations on the environmental side, such as the absolute 
amount of food intake. Thus when the total amount of food intake is 
low. as in a ncar-starvation diet, body weight undoubtedly depends to 
a much greatei extent upon differences in the amount ol food. When 
the total intake of food is large, individual dilferences in body weight 
are probably much less dependent upon diet. 

finally, we may consider a hypothetical illustration involving intelli- 
gence test scores. Suppose we find a 10-point diflercncc in JO between 
two identical twins icared in separate foster homes (A and B), and a 
3()-point difference in IQ between two unrelated children, one reared 
in foster home A and the oth. in foster home B. Can wc argue that 
the 10-point difference bctw'cen the identical twins measures the 
“diffcientiating effect” of these two ho’oc environments, and that the 
30-point difference between the uniehted children can thtTcfore be 
analyzed into 10 points attributable to environment and 20 points 
attributable to heredity? Could we conelude that, in so far as these 
cases show, heredity is twice as important as environment in the pro- 
duction of individual diflcrcnccs in 10? If wc follow the concept of 
interaction, the answer to both questi /Ps is ‘"No.” Actually, a very 
slight hci editary difference between the two unrelated children may 
have greatly augmented the difference between the cflectivc environ- 
ments of the two foster homes. The difference in environmental stimu- 
lation between the two homes w^oiild thus have been much greater for 
the unrelated children than for the identical twins. No simple sub- 
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traction of the end-products could disentangle the relative contribution 
of the factors whose initial interaction led to the obtained difference 
in 10. 

All the examples of interaction which have been discussed — the 
eyc-faccts of the fruit f\y, as well as the hypothetical examples of body 
weight and IQ — illustrate the interdependence of heredity and environ- 
ment, w^hich is fundamental to the concept of interaction. To sum- 
marize, interdependence means that the contribution of any given en- 
vironmental factor to a particular trait depends upon the individuaTs 
specific hereditary background; and conversely, the contribution of any 
given hereditary factor depends upon the specific environmental con- 
ditions within which it operates. Another implication of the concept 
of interaction is that the heredity-environment relationship can be 
more accurately likened to the arithmetic operation of ninhipliration ^ 
than to that of addition. The individuaTs characteristics may be con- 
ceived as the product, rather than the sum, of the hereditary and 
environmental factors, kinder these conditions, a slight difference in 
environment, in combinalion with a slight dillerence in heredity, may 
iihimately lead to a very large difference in the resulting characteristic. 
We must envisage such a ‘‘multiplicatioif’ of influences as occurring 
successively in the individuaks development, each new' “product'’ 
being itself the basis for further multiplication in an ever-widening 
radius. Thus a slight initial difference between two individuals may 
launch them on xwo widely diverging j)aths of development. 

Still another implication of the concept of interaction should be 
recogni/ed. Any estimate of the relative contribution of hereditary and 
environmental factors to individual differences obviously depends upon 
the range or extent of both hereditary and environmental dilfcrcnces 
w ithin the population under consideration. F'or example, susceptibility 
to diphtheria has been showm to depend upon a recessive hereditary 
factor, and immunity upon a corresponding dominant factor (25. 
pp. 370 -371 ). This disease w'ill not be contracted, however, without 
infection by the diphtheria baci^us. If, now, we consider a population 
all of whom have inherited susceptibility, then indi' idual differences in 
the development of the disease could be attributed entirely to the 

^ To spciik of herctauiry :ind cnviionmcnf ,il factors as being inuUiplied is obvi- 
ously an oveisiniplilic'ation, al/bough hc'p.'i.il m visualizing the rclalionsh!,A involved. 
The actual mathematical function by which hereditary and environmental contribu- 
tions combine is unknown and may well differ from one specific characteristic to 
another. 
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environmental differences, i.e., exposure to infection. On the other 
hand, in a population in which all are equally exposed to the bacillus, 
any individual differences would be attributable to differences in 
heredity, i.e., whether the dominant gene for immunity was present. 
To the question, “What proportion of the variance in the development 
of diphtheria is attributable to heredity?'’ opposite answers would be 
reached in these two populations. Similarly, a wide variety of inter- 
mediate answers could be reached in other populations, depending 
upon the relative frequency of exposure and Ihc relative frequency of 
individuals with the dominant gene for diphtheria immunity in each 
population.^ 

Throughout this discussion, the terms “heredity" and “environment” 
have frequently been used without qualilicalions for the sake of 
brevity. It sliould not be concluded, of course, that they refer to single 
entities or forces. Both heredity and environment are general names 
for complex mcinijolds of many specific influences. In the development 
of the individual, interaction occurs nithin as well as between the 
specific factors in each of the two categories. To speak of all the thou- 
sands of genes, each with its specific chemical and other properties, 
as though they represented a single force, operating as a unit to stimu- 
late development in a particular direciion, is highly misleading. It is 
even more clearly apparent that ‘'environment” is not an entity which 
can be conlra>ted or juxtaposed with “heredity.” Cellular environ- 
ment, radiation effects upon genes, birth injuries, educational history, 
and socio-economic level can scarcely be treated as a single influence! 


POPULAR AI1S( ()N( rPTIONS RLGARr^lNG HLRLDITY 
AND PNVIRONMLNT 

A number of misconceptions regarding the operation of heredity and 
environment arc still prevalent in popular thought. Before proceeding 
farther, we shall examine briefly some of the most common of these 


Tlic three implications of the concept of ‘ jtci action discussed above arc inde- 
pendent ol each otlici althoiiph all thr(‘e are .aly implied in ciiricnt discussions 
of the hcredily-enviromnent lehitionship. It would be loj’ically possible, for example, 
for Ihc operation of heredity and envionnient to he additive, while (he first and 
third conditions discussed still hd J. in this case, estimates of proportumal contribu- 
tion V OLild still be meanmcless. Or the heredily-enviionment iclalionship might be 
one of multiplication, without interdependence, i.e., (he weight of the hereditary and 
envnonmcntvd factors would vais iridepcndciuly ol each other. 
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erroneous beliefs, in order to clear the way for further analysis of the 
heredity-em ironment problem. 

Hereditary versus Inborn. One of the most common sources of 
confusion in discussions of heredity and environment is that between 
“hereditary” and “inborn.” The popular belief that whatever is present 
at birth is necessarily inherited is bolstered by the lack of precision in 
terminology. The dictionary definitions of such terms as “hereditary,” 
“inborn,” “innate,” “congenital,” and “native” are difficult to differen- 
tiate. Certainly the terms are often used interchangeably, in the scien- 
tific as well as the popular literature. The scientist usually employs all or 
most of them as synonymous with “hereditary.” The layman, on the 
other hand, frequently interprets all these terms with reference to birth, 
a reference which is obviously present in the root of such words as 
“inborn,” “native,” and “innate.” 

It is, of couise, just as incorrect to regard the influence of heredity 
in the development of any trait as ceasing at birth as it is to date the 
onset of environmental influences from birth. Hereditary factors may 
affect the development of the individual long after birth and, in fact, 
throughout the life span. Inherited susceptibility to various diseases, 
for example, may not be manifested until well past middle ag#. Even 
the age at which a person dies may be determined partly by hereditary 
factors, as suggested by the observation that longevity tends to run in 
families. Hereditary influences may thus become manifest for the first 
time at any age. rh<it environmental influences begin to operate long 
before birth lias already been demonstrated in the discussion of pre- 
natal environment. The influences of heredity and environment are 
co-extensive in time. Birth is not to be regarded as either a beginning 
or an end in the operation of these factors, but as one event in a 
developnicntal continuum which for the individual begins at concep- 
tion and ends at death. 

Resemblance to Parents. Another popular fallacy is the belief 
that heredity means parental resemblance, and vice versa. Both sides 
of this proposition can be shown to be false. That heredity need not 
result in the resemblance of oftspring to immediate forbears is apparent 
from a consideration of the mechanism of heredity. The genes are con- 
tinuous from generation to generation. They are not “produced’' by 

In some ■writini»s, “concenital” is used to signify presence of a characteristic 
at birth, as distinguished from “hercditaiy ” Ihere seems to be no linguistic justified’ 
tion for singling out this particular teim for jcfercnce to birth. 
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the individual parents, but arc simply transmitted by them to their 
offspring. Thus the individual inherits not only from his parents but 
also from all his direct ancestors. A characteristic which has remained 
latent for many generations may become manifest because of a par- 
ticular combination of genes, e.g., two recessives. The result will be 
an individual unlike his parents or immediate forbears in some one 
respect. Instances of this sort are common in family histories. One of 
the most familiar illustrations is that of two brown-eyed parents 
having a blue-eyed child, through the combination of two recessive 
“blue-eye’' genes in the offspring. In such cases, heredity actually 
serves to make the child unlike his parents. 

l'!)e converse proposition, that parent-child resemblance is neces- 
sarily indicative of heredity, is equally untenable. Such resemblances 
may have developed through the many environmental contacts and 
similarities of parent arid child, both prenatally (in relation to the 
mother) and postnatally. Not only are parents and children exposed 
to more nearly similar environments than arc unrelated individuals, 
bnt they constitute in part each other’s environment. Thus mutual 
influence as well common stimulation ma> serve to produce resem- 
blances. For these reasons no parent-child likeness can be attributed 
to hereditary factors without further analysis of its development. 

Inheritance of Acquired Characteristics. The Lamarckian hypoth- 
esis of the inheritance of acquired characteristics has found no support 
either in the experimental tindhigs of genetics or in the data of embry- 
ology regal ding the mechanism of heredity. Yet the popular belief per- 
sists that parents may transmit to their offspring physical as well as 
psychological characteristics which the parents have developed through 
training or experience. For example, t^ s opinion may be expressed 
that if the parents attend college, their chddrcn will as a result “inherit” 
superior mental ability; or that if the parents engage in athletic activi- 
ties, their children will have stronger muscles. Statements are also 
made to the effect that the parents’ acquired fears, interests, preju- 
dices, ethical or aesthetic standards, mechanical skills, and the like, 
may be inherited by the offspring. 

The truth of the matter is, of course, that only conditions which act 
directly upon the gametes, or germ cells, are transmissible to the off- 
spring. It is theoretically possible, to be sure, that certain activities of 
the parents may bring about the operation of effective physical agents 
upon the genes. Exposure to radiation (as from atomic bombs!) 
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would be an example. The action of various types of radiation in pro- 
ducing gene modifications, or mutations, was mentioned in an earlier 
section. Genes arc, however, extremely stable, and the agents which 
affect them very few. Certain other agents, such as alcohol, may injure 
the cytoplasm of the germ cells, thus affecting the development of the 
immediate offspring, but producing no inheritable change which might 
be transmitted to subsequent generations. Such direct physical effects 
on genes or cytoplasm are, however, a far cry from the “transmis- 
sion” of an interest in the classics or a laste for non-objective 
paintings! 

‘"Maternal Impressions.’’ An even more naive notion pertains to 
the influence of the mcUher's experiences during pregnancy upon the 
characteristics of the child. Under this heading would be included cer- 
tain popular explanations of “birthmarks” or the superstition that a 
man may have bushy eyebrows because his mother was frightened 
during pregnancy by a shaggy-haired Airedale! Another favorite illus- 
tration is that of the mother who attends lectures, concerts, and recitals 
during pregnancy in order that her child may acquire a desire for 
“■'culture." All such beliefs are now in the categoiy of superstitions and 
old wives’ tales. ' 

The only prenatal influences which the mother’s activities can exert 
upon the developing offspring arc indirect, biochemical effects. Thus 
certain toxic materials, germs, or any other agents carried by the blood 
stream can be transmitted by the mother to the embryo. Similarly, the 
mother’s general level of metabolism, her nutrition, and her endocrine 
balance may exert considerable inlluence upon the development of the 
embryo. It follows that excessive emotional excitement during preg- 
nancy, for example, may have an indirect effect upon the developing 
child, as a result of chemical changes in the maternal blood stream. But 
there is certainly no basis for expecting specific fears or other experi- 
ences of the mother to have a specific physical or psychological effect 
upon the embryo. 

“structural” and “fUN( TlONAl ” CH ARACTFRISTK S 

Up to this point, we have been discussing the operation of hereditary 
and environmental factors in general, without special reference to psy- 
chology. We may now turn to a consideration of the applieations of 
these concepts to psychological phenomena. The proper domain of 



Heredity and Environment 


121 


psychology is the behavior of individuals. Structural characteristics are 
important in this connection in so far as they impose certain limita- 
tions upon the development of behavior. A cat cannot learn to fly 
because it has no wings. If a child has a defective thyroid, his move- 
ments will be slow and sluggish, and his general behavior dull and 
stupid. For the development of certain types of behavior, vocal organs, 
hands, and a human nervous system arc essential prerequisites. The 
nature and development of bodily structures obviously play a part in 
determining the characteristics of behavior. 

7'he presence of certain structural characteristics should, however, 
be regarded as a necessary but not a siillicicnt condition for the devel- 
opment of any specific type of behavior. In other words, the presence 
of all the structural prerequisities does not in itself insure that the 
given behavior will appear. It also follows that the absence of a given 
type of behavior docs rot necessarily imply a stnictural dellciency, 
nor do behavior vaiiations necessarily imply corresponding structural 
variations. Except for individuals with gross pathologicad defects, the 
structural equipment of most persons is such as to permit an almost 
uniimiled variety of behavior development. 

Much confusion and controversy in discussions of heredity and en- 
vironment in psychology arise from a failure to distinguish between 
behavior characteristics and structural characteristics. Statements re- 
garding the “inheritance” of feeblemindedness, musical talent, mathe- 
matical aptitude, or criminal tendencies are at best highly misleading.^ ^ 
Certainly, no one expects disembodied functions as such to be mysteri- 
ously transmitted Kiroiigh the 'nc >. The genes arc obviously specific 
chemical substances which, through many successive interactions with 
other substances in the environment, eventually bring about the de- 
velopment of the structures making up die individual. No “potential- 
ities,” “tendencies,” “influences,” “determiners,” or other mystical 
entities can be discovered in the genes. 

What, then, can be said regarding the role of heredity in hehavhn? 
Above all, it is clear that hereditary factors cannot aflect behavior 
directly, but only indirectly through th: structural equipment of the 
individual. 'I he immediate question thus resolves itself into a consid- 
eration of the role of structural characteristics in behavior develop- 
mcnld“ In what way arc given behavior characteristics related to 

In many instanoi’s, of course, they arc completely unfoiiudctl. But at this static 
we ate not cons'deiing the factual mater rd. 

■'^Bor a fuller elaboiatron of this point, cf. Anastasi and f'oley (I). 
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structural conditions, such as glandular defects, pathological brain 
conditions, chemical composition of the blood, and the like, and in 
what way are they related to functional conditions, i.e., the individual’s 
previous reactional biography? 

When a specific structural condition is found to be associated with 
a given behavior characteristic, then the question of heredity and en- 
vironment can be raised. If, for instance, a particular behavior de- 
ficiency is shown to be regularly associated with a certain brain con- 
dition, this condition may in turn be traceable to the presence or 
absence of a specific gene or combination of genes. On the other 
hand, the brain condition may result from physical or chemical 
characteristics of the prenatal environment, from birth injuries, or 
from other environmental factors. Lack of one specific gene may 
prevent normal brain development and thereby result in a form 
of feeblemindedness. In such a case, this particular type of feeble- 
mindedness would appear as a simple Mendclian unit in genetic 
studies of family pedigrees. Findings such as these would not, 
however, justify the assertion that “feeblemindedness” is a simple 
Mendelian recessive, as was proposed in some of the earlier psy- 
chological writings (cf., e.g., 9). In the first place, such a finding 
does not imply that only one gene is required for normal mental 3e- 
velopment. Undoubtedly many genes contribute to the structural 
development necessary for “intelligence.” The absence of one or a few 
specific genes may, nevertheless, previent the effect of the others from 
being manifested. Hence a particular defect in a structural character- 
istic may be transmitted as a Mendclian unit, although the character- 
istic itself depends upon the combined effect of a large number of 
genes. In the second place, the presence of all the required genes 
would not insure normal intelligence. Intellectual development — as 
all psychological development — depends upon the individual’s reac- 
tional biography, viz., upon what he does with his structural 
equipment. 


THE CONCEPT OF “UNLEARNED BEHAVIOR” 

One of the major sources of confusion and controversy in psycholo- 
gists’ discussions of heredity and environment centers around the con- 
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cept of “unlearned behavior.” Among the criteria for the identification 
of such unlearned behavior which have been proposed from time to 
time may be mentioned; universality within a species, uniformity 
among different members of the species, sudden appearance without 
subsequent change, uniformity of developmental sequences in those 
cases in which change does occur, and “adaptiveness” or effectiveness 
far in excess of that which could reasonably be expected from the 
animal’s own learning. Objections have been raised to each of these 
criteria (cf., e.g., 17; 20, Ch. Ill), the principal criticism being that 
behavior which meets any or all of these specifications can and does 
at times develop through learning. 

The only completely dependable criterion of unlearned behavior is 
the demonstrated absence of the opportunity to learn. If this criterion 
is applied, instances of unlearned behavior can still be found in various 
species, the clearest illustrations being furnished by the behavior of 
certain insects. In such illustrations, highly uniform and complex series 
of activities are performed despite the fact that the animal has had no 
previous contact with other members of the species or with the objects 
toward which the behavior is manifested. In many such species the 
parents die or abandon the eggs long before they are hatched. Thus 
the offspring have no opportunity to learn by observing the parent’s 
behavior, nor does the parent have any opportunity to observe the 
effect of its preparatory activities upon the offspring. 

A favorite illustration of such unlearned behavior is the frequently 
cited pollinating behavior of the yucca moth. As soon as this insect 
emerges from its chrysalis, it t-uvcls to a yucca flower, from which it 
obtains pollen. It then finds another yucca flower, where it deposits its 
eggs as well as the newly gathered pollen, following a highly stereo- 
typed sequence of reactions. The fertilised ovules of the flower, which 
result from this pollination, provide food for the yucca larvae when 
they emerge from the eggs four or five days later. In commenting 
upon the unlcarn?d nature of this pollinating behavior. Stone (27) 
has written: 

The adult does not partake of the pollei: which it gathers and probably 
obtains no nourishment at all from the plant while pei forming this round 
of complicated activities. . • . The adult insect docs not learn this com- 
plicated series of acts through imitation of its parents, long since dead, or 
from contemporaries cither, for its visual receptors do not provide the 
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kind of vision necessary to the human concept of visual guidance. Action 
systems of the larvae are totally unlike those of adults, and the activities 
arc even performed with different appendages. The body of this same larva 
that descended the silken thread to bury itself in the ground is dedifferen- 
tiated and resynthesized during the resting state, and a prolonged interval 
of lime, the winter season, Luervenes between the last act of the larva and 
the first of the adult. In view of these facts, no concept of memory or 
transfer of training supported by experimental evidence can be invoked 
to account for the behavior of the yucca moth (?7, p. 46). 

Obviously, “unlearned behavior” can only mean behavior which is 
determined wholly by the structural characteristics of the organism, 
such that the mere presence of the necessary structures at a certain 
stage of development insures the appearance of the behavior in ques- 
tion. Merely to say that a certain type of behavior is unlearned, how- 
ever, is no answer to the question of how it develops. Such a statement 
only reformulates the problem, so that the question still remains to be 
answered. The answ'cr now calls for knowledge of what structural fac- 
tors determine such behavior and how^ they operate. To prove that 
behavior is unlearned, i.c., not learned, is a negative finding, ♦which 
furnishes no positive information. It docs not in itself tell us how*^ the 
behavior develops. To call such unlearned behavior “instinctive,” 
“innate,” or “hereditary” simply obfuscates the problem, because 
these icrms seem to suggest positive explanations or active processes, 
whereas in this case tlicv are being used only as syfuniy/ns fiU’ the 
negative tertn 'Unilearned.'* 

The same dilficulty arises in the common use of the term "‘niatura- 
tlon' in psychological writings. In discussions of the origin of behavior, 
a distinction is usually made between development through learning 
and deveiopment through maturation. The latter refers to the sudden 
appearance of certain behavior, regardless of the previous activities of 
the organism, as soon as the requisite stage of structinal development 
is attained. ITis term is misleading for several reasons. It suggests a 
positive process of behavior development, without making it suffi- 
ciently clear that it is the structures that are developing. Moreover, 
certain writers who use the term “maturation” easily slip into the 
implication that such behavior results from an “unfolding of poten- 
tialities” which were present in the genes, and that it is therefore 
inherited. 
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Strictly speaking, it is incorrect to regard unlearned behavior as 
hereditary. In the first place, behavior cannot be inherited as such. 
It is only structural characteristics which can be directly influenced by 
the genes. In the second place, the structural conditions which deter- 
mine such unlearned behavior may themselves result from either 
hereditary or environmental factors, or varying combinations of 
the two. 

Certain psychologists maintain that structurally determined or “un- 
learned*” behavior falls outside of the scope of psychology. This is the 
position taken by Kanlor (16, Ch. IV), for example, who holds that 
biological functioning follows directly from the structural properties 
<.»f the organism and the physical characteristics of the stimulus, 
whereas psychological functioning depends upon the individual’s pre- 
vious interactions with stimuli. Other psychologists would be reluc- 
tant to exclude consideration of such unlearned, structurally deter- 
mined behavior from (lie proper domain of psychology; some have, 
in fact, devoted virtually all their research to its study. Whether one 
defines psychology so as to include stiuelurally determined behavior, 
or whether one insists that all such behavior belongs under the head- 
ing of biology is in itself only a question of division of labor or per- 
sonal interest. The essential point is to have a clear and unambiguous 
i.nderstanding of wliat is meant by unlearned behavior and to avoid 
ill-defined, mystical implications in its discussion. 

“Unlearned behavior” has been traditionafly subdivided into such 
categories as tropism, reflex, and instinct. These distinctions arc not 
sharply drawn. Some writers have, in fact, used one or another of 
these terms exclusively to designate all unlearned behavior. The most 
common usage, however, is to dcsigrate as iropisiic any behavior which 
is primarily an orientint (turning, approach, withdrawal) response of 
the entire organism toward a stimulus, such a response being essen- 
tially ‘^forced” b\ the physical and chemical properties of the stimulus 
and of the reacting organism. An example is the turning and bending 
of plants toward the sun or other source of light. '^Reflex' generally 
refers to a specific response of a pai. of the organism to a particular 
form of stimulation. The term is usually applied only to organisms 
which have a synaptic nervous system. The structural basis of the 

It shotiia be added that the slimidus itself is defined difTercntly by Kantor in 
these i\\o situations luf, 15, m’* I. C'h. 11). 
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reflex is the “reflex arc,” consisting of receptor, neurones, and effector. 
Two examples in man are the patellar reflex, or “knee jerk,” and the 
pupillary reflex, or contraction and expansion of the pupil as intensity 
of illumination changes. 

The term ^'instinct” has been used with more varied meanings, al- 
though nearly all its definitions imply a greater complexity of behavior 
than is represented by either “tropism” or “reflex.” Some use the term 
to refer to a chain or integration of reflexes, as illustrated by the 
complex stereotyped sequences of unlearned activities observed in cer- 
tain insects, such as the yucca moth cited above. Others use the term 
in a vaguer sense to mean a relatively rough framework within which 
considerable variability of specific behavior may occur. In such defi- 
nitions, instinct is often related to physiological needs, such as the need 
for food or water, and to the presence of hormones. It is this latter, less 
specific use of the term “instinct” that has opened the way for many 
unbridled leaps into an improbable terrain. It is here, for example, that 
one finds discussions of gregarious or collecting "Instincts” and the 
like. Not only have the structural properties leading to gregariousness 
or collecting behavior never been identified or even vaguely ^lesscd, 
but the nature of this behavior is also such as to make the search for 
its structural correlates appear futile and meaningless. 

It is undoubtedly true that isolated instances of behavior can be 
found which clearly fit the definitions of tropism, reflex, or instinct. 
On the other hand, most behavior — human or infrahuman — cannot be 
classified into any one of these categories. Certain segments or aspects 
of a complex activity could probably be described as tropistic, re- 
flexive, instinctive, or learned, the activity itself including more than 
one of these various components. It would seem, moreover, that these 
terms, as well as the term “maturation,” lend themselves too readily 
to misunderstanding and unwarranted implications. To say that a 
given activity or a particular component of an activity is unlearned 
(provided it has been conclusively demonstrated to be unlearned) is 
certainly a more precise and objective description of the actual obser- 
vations. To call such an activity structurally determined adds to the 
observation the only possible source of the occurrence of such be- 
havior. At the same time, the designation “structurally determined” 
centers attention on the question which logically follows, viz., what 
structures are involved and how do they bring about such behavior? 
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METHODS FOR THE STUDY OF HEREDITY 

AND ENVIRONMENT 

We may conclude this preliminary introduction to the heredity-en- 
vironment problem with a brief overview of the methodology which 
has been developed for its study. Psychologists have followed many 
and varied approaches in their efforts to disentangle the factors which 
underlie behavior development. Some of the methods yield results 
which are highly ambiguous and difficult to interpret. Few approaches 
can give conclusive answers by themselves. Each has its own peculiar 
advantages and limitations. Frequent resort is made to the combination 
of methods and the mutual corroboration of data in the attempt to 
remedy the shortcomings of any one technique. A few of the methods 
winch will be outlined below were not specifically designed for the 
study of heredity and environment. For example, investigations of sex 
differences and racial differences are conducted primarily because of 
an interest in the psychological characteristics of the specific groups 
under consideration. Some of these studies are nevertheless set up in 
such a way as to contribute toward an analysis of the factors determin- 
ing behavior development, and they have been included in the present 
listing for this reason. 

It has often been pointed out that the crucial psychological experi- 
ment on heredity and environment has yet to be done. The chief diffi- 
culty confronting the investigator in this field is that ol isolating the 
inffuence of hereditary ar..' environmental factors. As in all experi- 
mental design, the essential prerequisite is the control of conditions in 
such a way that all variables shall be constant except the one whose 
influence is being investigated. Since in most investigations on individ- 
ual and group differences, hereditary and environmental factors have 
varied simultaneously, the results are incapable of definitive inter- 
pretation. 

If heredity can be assumed to be constant, as in the case of identical 
twins, then differences can be attributed unambiguously to environ- 
ment. Similarly, if environment is held constant, any observed differ- 
ences must be the result of hereditary influences. In view of our dis- 

survey of promising methodological resources of which little or no use has 
so far been made is given bv Stone (28). Most of the methods discussed by Stonn 
are limited to i datively simple animal foims. 
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cussion of what constitutes the effective environment, however, it 
should be apparent that it is extremely difficult to hold environment 
constant for two individuals, especially for human subjects. A few of 
the techniques employed clearly make no attempt to separate heredi- 
tary and environmental factors, and consequently yield results which 
arc at best descriptive, and not explanatory. On the other hand, some 
of the approaches make it possible to segregate, at least partially, the 
relative contribution of heredity and environment to the development 
of individual differences in specific characteristics. 

For convenience, the principal methods employed by psychologists 
in studying the heredity-environment problem have been grouped into 
fourteen categories. A discussion of each of these methods, tc:>gcther 
with illustrative data, will be found in the appropriate chapters in 
Parts II and 111. These chapters have been indicated in parentheses 
next to each category. 

Since the now famous experiments of Mendel, geneticists have made 
constant use of seleclivc breeding (Ch. 5) to investigate the inherit- 
ance of structural characters. In recent years the method has been 
applied to the study of psychological characteristics. Laboratewy rats, 
for example, have been bred for maze-learning proficiency and other 
behavior characteristics. This method is obviously not feasible with 
human subjects. 

blonnathe developmental studies (Ch. 5 and 9) are observational 
studies of the course of behavior development in the growing organ- 
ism. Regularity of sequence in dev'clopmental stages is of special 
interest in connection wath the heredity-environment problem. Obser- 
vations of this type have been made on both infrahuman and human 
subjects, and during prenatal as well as postnatal periods. Studies of 
the ‘'growth curve” of various behavior functions, and of tlie decline 
of functions with age may be regarded as extensions of this method to 
later age levels (Ch. 9). A closely related approacli is the study of the 
structural correlates of behavior development (Ch. 5). This method 
has been used principally with lower animals and during the prenatal 
stages, although it is also applicable to a limited extent postnatally and 
with human subjects. By means of this method, the first appearance 
of certain behavior functions, such as specific types of movenvmt, may 
be linked with a particular phase of development in the nervous system 
or other bodily structures, and subsequent behavior development may 
be traced in conjunction with structural changes. 
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One of the most direct approaches to the heredity-environment 
problem is the experimental variation of environmental conditions. 
This approach is illustrated by the artificial prevention of the exercise 
of a function and the subsequent observation of the effects of such 
deprivation upon the development of the function (Ch. 6). Sometimes 
the experimental variation consists in giving additional exercise or 
training in a particular function, in order to determine the extent to 
which the normally observed course of development may thereby be 
altered. Certain investigations take advantage of a sort of “unplanned 
experiment” of this type afforded by the varied infant-rearin^q practices 
(■) different cultures (Ch. 6). For example, the prevention of locomo- 
tion or of the exercise of other motor functions, beyond the age when 
such functions are well developed in other cultures, permits an analysis 
of the relative dependence of these functions upon structural growth 
and upon exercise. A similar ^unplanned experiment” is furnished by 
the cases of so-called feral man (Ch. 6). Ihis is a term applied to 
children who have apparcnlly been isolated from human beings at an 
early age and have either been reared by animals or have shifted for 
themselves in the absence of any companions. 

A much more restricted type of training experiment is represented 
by investigations of the effects of practice and coacfiinq on mental test 
performance (Ch. 7). Such studies cover the influence of practice 
upon the extent of individual dill'crcnces, as well as the effects of 
coaching or of the repetition of a test upon the general level of per- 
formance. Tn recent years, considerable attention has been attracted 
by a related type of investigation, concerned with the effects of school- 
ing upon niental test performance { 8). A large number of these 

studies have been concerned with the possible increase in 10 following 
nursery school attendance, although a few have dealt with the influ- 
ence of education at the elementary school and higher scholastic levels. 

Family resemblances and differences liave long been a favorite 
method for the study of heredity and environment, although this ap- 
proach is beset with many difticulties 'md its results are likely to prove 
ambiguous. The study of family history (Ch. 10), introduced by 
geneticists, is most fruitful when applied to relatively simple character- 
istics. Very limited use has been made of it in the study of psycho- 
logical characteristics; some of the applications of the method in this 
area are open to serious criticism. Intrafamilial correlations (Ch. 10) 
have been frecjucntly computed with the results of mental tests admin- 
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istercd to parents and children, siblings, and other related individuals. 
The study of twins and foster children (Ch. 11) offers the oppor- 
tunity for a more direct analysis of the influence of hereditary and 
environmental factors. 

Investigations of the relationship between individual differences in 
structural and behavioral characteristics (Ch. 12) may suggest cer- 
tain physical correlates of individual differences in behavior, which 
could in turn be traced to hereditary or environmental factors. This 
method is to be distinguished from the direct study of the structural 
correlates of behavior development, cited earlier. The present method 
is a statistical rather than a developmental one, and has generally been 
applied to fairly complex functions in the human adult. Typology, or 
the search for “constitutional types” with structural as well as psycho- 
logical differentia, may be included under this heading (Ch. 13). 

The comparison of socio-economic groups (Ch. 23), including oc- 
cupational levels, urban and rural groups, individuals living in isolated 
or “culturally backward” communities, and the like, represents an- 
other approach to the problem. A related method is the cross-compari- 
son of cultural and biological groupings (Ch. 22). Groups which arc 
biologically differentiated, such as the two sexes or different races, 
have been compared on a wide variety of psychological tests. Of 
special interest, however, arc those investigations in which “cross- 
comparisons” can be made between such biological (i.c., hereditary) 
groupings and the cultural (i.c., environmental) groupings which cut 
across them. 

At this stage in our treatment of the problem, we have considered 
the concepts of heredity and environment, the complex relationships 
between hereditary and environmental factors, popular misconceptions 
in this area, and the importance of distinguishing between structural 
and functional characteristics. Various implications of the concept of 
unlearned behavior have also been examined, followed by an introduc- 
tory listing of the methods used by psychologists in studying the hered- 
ity-environment problem. From this preliminary survey, it should be 
apparent that the problem is by no means a simple one. Alluring gen- 
eralizations can only mislead in a topic which is intrinsically complex. 
If this discussion has given the reader some conception of the com- 
plexity of the heredity-environment relationship, it has served its pur- 
pose well. Moreover, if the reader has come to recognize the 
importance of careful use of terms, to distinguish between superstition 
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and established fact, and to follow deductions logically and objectively 
in the heredity-environment area, he will have made significant strides 
in his thinking. An honest, forthright recognition of the complexity 
and inherent difficulties of the problem, as well as the liijiitations of 
our present knowledge in this field, is to be preferred to a list of 
glossy oversimplifications. 
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Biological Factors in Simple 
Behavior Development 


Part i served as an introduction to the basic concepts and meth- 
odology of dilfcrcntiai psychology. We may now consider some of the 
principal findings regarding the nature and sources of individual differ- 
ences. Throughout the chapters which follow, the persistent question 
of hercdity-and-environment will be repeatedly encountered. The 
various methods for the study of this question, outlined in the pre- 
ceding chapter, will be treated in the remainder of the book in connec- 
tion with the topics to which they are most relevant. The organization 
of the following chapters is based primarily upon the traditional areas 
of investigation within differential psychology rather than upon a logi- 
cal analysis of the problems. Such an organization was obviously 
necessitated by the available results. 

In the present chapter, ij^pical findings pertaining to the develop- 
ment of simple behavior functions will be considered. The studies to 
be included arc classifiable under th first three of the methods listed 
in Chapter 4. Although these methods have many points of difference, 
and vsome could more logically be combined with methods to be treated 
in later chapters, they have been brought together in this chapter 
because of certain common features which make their joint considera- 
tion convenient. In the first place, these approaches have been con- 
cerned with relatively simple behavk^ ^o the almost complete exclu- 
sion of the more complex linguistic and other symbolic activities. 
These methods thus make virtually no use of psychological tests, which 
have played so large a part in many of the other approaches. Since 
special methodological problems arc presented by psychological tests, 
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it is more expedient to consider separately those investigations which 
do and those which do not employ testsJ 

In the second place, the studies to be considered in the present chap- 
ter have dealt largely with relatively simple organisms, viz., those at 
lower phylogenetic or ontogenetic levels. Many of the observations 
have been made on infrahuman animals. In those investigations em- 
ploying human subjects, the factors studied operated at an early age, 
prenatally or during infancy or early childhi^od. A third feature which 
characterizes the present group of studies is their emphasis upon the 
biological or structural conditions underlying behavioral differences. 
Thus selective breeding, the charting of progressive age changes in 
behavior, and the observation of structural changes which parallel 
behavior changes may all be regarded as ways of determining the effect 
of structural conditions upon behavior development. T(; be sure, psy- 
chological differences in the stimulating conditions arc also present 
when comparing, for example, organisms at dilferent age levels, but 
the emphasis in the present approaches has been put upon the biologi- 
cal conditions. 

SELCCTIVI-. URUrniNG 1 OR BUHAVIOR CHARACTnRISTK'S 

The experimental breeding of animals selected on the basis of behavior 
characteristics is a recent application by psychologists of the technique 
of selective breeding long in use by geneticists. T his is the basic method 
of genetics for the study of the inheritance of any characteristic. 
Through several refinements of this method (as in the analysis of cross- 
overs and ‘‘linkage groups"), geneticists have succeeded in analyzing 
the hereditary basis of many structural characteristics and even in 
constructing theoretical “gene maps” for certain species. The present 
applications of selective breeding to behavior phenomena, however, arc 
far from reaching such refinements. All that these studies have 
achieved to date is to effect through successive generations of selective 
breeding, the development of two strains which differ signitieantly in a 
given behavior characterisiic. Following the establishment of the con- 
trasted strains, a beginning has been made in the search for possible 
structural bases for such behavior differences between strains. 

^ The method of selective breeding, for example, could lopically be combined 
with studies on familial lesemblanccs and dilTerences, to be treated in Chapter 10, but 
the latter approach makes extensive use ot psychological tests. 
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One of the most extensive of these investigations employing selective 
breeding is that conducted by Tryon (51, 52, 53) on maze learning in 
white rats. An initial group of 142 rats were given 19 trials in running 
a maze, and the number of “errors,” i.e., entrances into blind alleys, 
was determined for each animal. The group exhibited wide individual 
differences in maze-learning ability, the total number of blind-alley 
entrances in 19 trials ranging from 7 to 214. On the basis of these 
scores, a group of the brightest and a group of the dullest rats were 
selected for experimental mating. The “bright” rats in this parent 
generation (P) were mated with each other, and the “dull” were like- 
wise mated together. This procedure was followed through 18 filial 
generations (FI to FI 8). In each successive generation, the “bright- 
est” rats were selected in terms of maze performance and were bred 
together, the “dullest” being similarly selected and interbred.- Environ- 
mental conditions, such as food, lighting, temperature, and living 
quarters, were kept constant for all rats in the different generations. 

The effect of such selective breeding upon maze perlormance is 
illustrated in Figure 37. The distribution curves indicate the percent- 
age of rats in each gioup making the number of errors shown on the 
baseline. It will be noted that the distributions ol the bright and dull 
sub-groups gradually separate until there is virtually no overlapping 
between them when the F7 generation is reached. Beyond the seventh 
generation, the additional effects of selective breeding arc negligible. 
In subsequent generations individual differences viilhin the bright and 
dull groups remain practically unchanged, and the dilleientiation be- 
tween the two groups shows no appreciable increase. When rats from 
the bright and dull groups were interbred, a distribution similar to that 
of the original parental group resulted, most of the animals now ob- 
taining intermediate scores, with relatively few at the dull and bright 
extremes. The distributions of the bright and dull parental groups and 
of two cross-bred filial generations arc given in Figure 38. Tryon sug- 
gests that the results of this cross-breeding experiment arc consistent 
with the hypothesis of multiple factor inheiitance, some of the factors 
being “dominant for bright performance, some (but fewer) dominant 
for dull, and some cumulative” in their effect (53, p. 116). 

Extensive analyses of the characteristics of the bright and dull 

2 After the FI geneiation, a modified ‘‘progeny test” was applied in selecting 
individuals for breeding, the individual being Uassilied not only in terms of his own 
maze performance, but also on the basis oi the performance of his forbears. 
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Number of Errors i Maze Learning 
J-'Bright” Strain, --Dull” Strain 

Fig. 38. The Effect of Cross-Breeding Rats from “Blight” and “Dull*' 
Strains. (From Tryon, 53, . 1J5.) 
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groups have demonstrated consistency of performance level both when 
retests were made at later ages and when the error scores, in terms of 
which the selective breeding was conducted, were compared with other 
aspects of the animal’s maze performance. The evidence does not, 
however, support the view that the animals differed in ''general learn- 
ing capacity/' the difference between the two genetically contrasted 
groups being specific to the maze situation. Such specificity of learning 
behavior has been corroborated by other investigators. Experiments 
designed to disrupt various sensory cues available in the maze learning 
situation showed relatively negligible effects on the performance of the 
bright rats, many showing no disturbance at all following cue disrup- 
tion. The performance of the bright rats thus appeared to be largely 
independent of possible differences in sensory acuity. An analysis of 
the frequency of errors in each alley showed a consistent dilferencc in 
the pattern oj error frecpicncics between the two groups, a difference 
which persisted, on the whole, when another maze was employed. Such 
a finding suggests a possible dilferencc in the animals’ approach to the 
solution of the maze. 

Differences in entotionality were aKo found between the omntrasted 
groups, the bright animals showing less emotional response, such as 
hiding, avoidance, escape, and vocalization reactions, when enc(uinlcr- 
ing novel inanimate objects in the maze, but more neurotic behavior in 
response to handling, rhe bright grtaip was significantly superior in 
certain physical characteristics, surpassing the dull group in brain size, 
brain weight, and body weight.*’ 1 he dull group, on the other hand, 
excelled in fertility, 'f ryon regards the interpretation of these physical 
differences equivocal ‘‘because of the intensive inbreeding that has 
occurred during the process of selective breeding’’ (53, p. 116). 

Strains of “maze-bright” and “maze-dull” rats have been bred by 
Heron (23, 43) through the sixteenth filial generation. In this study, 
no significant differences in brain w eight or in ratio of brain weight to 
body weight were found between the two contrasted groups ^ (43). 
Differences in speed of running ( 23 ). however, favored the bright rats, 
A further analysis (22) indicated that the group which excelled in 
maze performance manifested a higher level of general activity and 

Since brain \> closely corielated with body wcitdit, it is unf irtunale that 

ratios of brain weight to body v/emhl arc not j^ivcn 

In fact, most of the comparisons tended to show greater brain size among the 
dull rats, but the mean differences were all small, the largest being 2.21 times its 
standard error 
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stronger motivation. Observation of behavior in the maze situation 
corroborated the hypothesis that such dilfcrcnccs operated in the maze 
learning, the dull rats being described as behaving more like non- 
hungry rats or hungry rats after the food has been withdrawn from a 
previously learned maze. These ditlcrences were not. however, suffi- 
cient to account for the entire difference in maze learning, and other 
factors very probably contributed to the level of maze performance 
of the two groups. 

Several groups of animals have been directly bred for emotionality 
and for general activiiy level. Rundquist (39) selected rats on the 
br .is o\ sponiancous activity as measured by performance on a rotating 
drum. Selection was carried out through the fifth generation solely on 
the basis of individual activity; beyond that the extreme individuals 
within the active and luactive strains, respectively, were bred, with no 
crossing between the two strains. At the FI 2 generation, the two 
strains were well separated in activity level. This investigator reports 
that activity level showed little, if any, relationship to maze learning. 
A follow-up study of the same two strains is reported by Brody (3). 
Selective breeding of these animals until the F29 generation led to a 
marked decrease in the mean activity, as well as in individual differ- 
ences in acLi\ ity level, in the inactive strain, but no change in the active 
strain. Apparently, active rats w^erc eliminated from the inactive strain, 
but not inactive rats from the active strain. From the results of strain 
crosses and back crosses. Brody proposed a specific hypothesis re- 
garding the genetic transinis., :n the factor determining activity level 
in rats.'* The author also pc^ints out, however, that environmental con- 
ditions seem to obscure the segregali n of hereditary factors in some 
of the cross-matings. 

Hall (19) obtained sliains of ct notional and unemotional rats by 
selective breeding. vSiibsequent observation of these two strains showed 
that the emotional rats tend to have a lower activity level in free sit- 
uations, and to be more variable or less stereotyped in a situation 
calling for choice. In a later study ('^-' on the same strains, it was 

•’ Si)ecinc'i11v, Brody conclude^^ that- . . the two strains differ with respect to a 
gme ralhci than with res^vi .1 to multiple factors. . . . Ihe gene apparently 
behaves as a dominant in the males and as a recessive in the females. . . . The gene 
which dctcimii es inactivity must act as an inlii'o tor since none of the matings within 
the inactive st/ain produce act-v' offspring, but, on the olhei hand, active strain 
matings produce individuals which vary from extreme inactivity to extreme activity” 
(^, pp^. 23 24) 
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found that the rats from the non-emotional, fearless strain were con- 
siderably more aggressive than those from the emotional, timid strain. 
This result suggests that aggressiveness may be related to genetic fac- 
tors. An investigation of fighting behavior of male mice (40) also 
showed sharp differences between inbred strains. 

In evaluating such studies on the genetic bases of emotional re- 
sponses, it should be noted that a number of other investigations have 
suggested the dependence of emotional and '‘neurotic'’ behavior in the 
rat upon certain environmental conditions, such as diet, “taming,” and 
previous relations with cage-mates (cf., e.g., 12, p. 223; 18; 38). To 
demonstrate that a certain phenomenon depends upon heredity does 
not, of course, preclude its dependence upon environmental factors, 
and vice versa. 

The researches of Stockard (48) and his collaborators on different 
breeds of dogs are relevant to the present approach, although in these 
studies the animals were not bred directly for behavior characteristics. 
Two groups of dogs were chosen which present a strikingl> different 
picture both structurally and bchaviorally. One group, consisting of 
basset hounds, was characteristically inactive and lethargic; 4he other, 
including German shepherd and Saluki dogs, represented the opposite 
extreme of activity and alertness. Consistent and clear-cut differences 
in the behavior of these two gpups were observed in a series of con- 
ditioning experiments. When only members of the same breed were 
mated, successive generations were found to “breed true” for both 
morphological and behavior characteristics, i.e., the offspring showed 
the same characteristic structural and behavioral pattern as the parent 
generations. Cross-breeding yielded a distribution of behavior types 
in the F I and F'‘2 generations consistent with the hypothesis of mul- 
tiple-factor inheritance. 

Differences in endocrine activity between the breeds were suggested 
as a likely basis for the observed behavior differences. The investiga- 
tors point out that the basset hound has a relatively inactive thyroid, 
giving the animal a low metabolism, I his condition, together with the 
correspondingly low activity of the other glands, is probably a factor 
in the animal’s characteristic inactivity and lethargy. 1'he German 
shepherds and Saluki, on the other hand, have highly active thyroids. 
Considerable evidence confirming this explanation is furnished by an 
extensive series of investigations involving the removal of various en- 
docrine glands, as well as the experimental administration of glandular 
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extracts. The subjects were dogs of known genetic history, mostly from 
the F2 generation of the above-mentioned crosses. In behavior, they 
were intermediate between the two extreme types. Through experi- 
mental glandular control, their behavior could be made to vary in the 
direction of either of the two extreme genetic types. 

In summary, it may be noted that through selective breeding it is 
possible to produce strains which are clearly differentiated and sharply 
contrasted in such behavior characteristics as activity level, emotion- 
alit>, and maze learning. Some data are also available regarding the 
structural characteristics which underlie these behavior differences. 
Glandular conditions, body size, brain size, and factors related to 
health, vigor, and strength of the hunger drive are suggested as possible 
bases for the strain differences in learning behavior. Comparisons of 
the physical charactei istics of the contrasted strains sometimes yield 
inconsistent results, one investigator finding a significant difference in 
a particular physical characteristic, while another finds no difference 
in the same characteristic. These inconsistencies are not, however, 
unexpected if activities such as maze learning are influenced by a 
multiplicity of structural conditions. For example, if we suppose that 
maze learning can be facilitated by six different structural factors 
(a, b, c, d, e, and f ), an individual experimenter who selects good maze 
performers in the parental generation may get rats which by chance 
excel in four of these six relevant structural characteristics (a, b, c, 
and d). The extensive inbreeding which follows in successive genera- 
tions will augment these pat uLr structural differences, since in effect 
the experimenter was selecting the animals in terms of these charac- 
teristics even though he may have b 'en unaware of it. By the same 
token, another investigator w'ho singhs out his “maze-bright” rats for 
mating may be selecting them in terms of structural characteristics 
d, e, and f. In that case, successive generations of selective breeding 
will produce strains differentiated in d, e, and f, but not in a, b, and c 

THE NORMATIVE DEVELOP’ TENT AL STUDY OF BEHAVIOR 

Charting the course of behavior development is of considerable theo- 
retical as well as practical interest in its own right. In the present con- 
nection, however, we are concerned OTily with the use which has been 
made of such studies in an attack upon the factors which determine 
behavior differences. In examining the “stream of behavior” as it 
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appears in the growing organism, investigators have looked for any 
clues concerning the mechanism of behavior development Thus, the 
'sudden emen^ence ’ of functions and the ''sequential patterning'' of 
development have been commonly regarded as CMdencc for the im- 
portance of maturationcil lactor*- It has been repeatedly argued that 
behavior which appears suddenly, in more or less final form, when the 
organism has reached a certain age, is unlearned The uniloimity ol 
dc\elopmental stages, or sequential patterning, in any partieular tunc- 
tion has likewise been eitcd as a criterion ol unlearned bcluiMor, on 
the grounds that opportunities for learning are likely to varv from one 
indi>idual to another and eould not result in such a consistent succes- 
sion ol like stages We shall consider these cnteria in the light of some 
of the data to be repented below 

r)bscr\ations of the normal course of bchaMor devi lopment hive 
been made e\tcnsi\cl\ on infraluiman as wtll as human subjects, and 
during pienatal iis well as posln Pal cle\ clopim. nt The iclative sim- 
plicit} of tlie processes in lowei ioims fae litales the iccoumlion of the 
essential eharactcristies ot development veho e applieabilitv to higher 
foims can then be moic read'lv 'studied Similarly some oWtln most 
''igmficant observations have been made at the prenatal lud neon it d ^ 
stages partly because of the gieater simplieitv of behavior at these 
levels and partly because environmental diveisities and opportunities 
for learn ng aie not so gieat is in the e ise ol older sub)e"ets fo be 
sure investigations oi. the growth euive’ of psychological functions 
and on the giowtli and dee line ot intciligenec also logically belong 
under the heading of normative developmental studies Methodoiogi- 
cailv, however, sueh i iv«.stigations hive much more m common with 
other approaches emploving psychologic il tests and can therelore be 
more clleetivelv evaluated in connection with the latter (tt Ch 9) 

The studies to be discussed in the. following se».tiem repiestnt only 
a few outstanding investigations of the behavioi development of intia- 
human subjects, a field in which a wealth ot data is graduallv accu- 
mulating The iivailable information on human fetal development (1o 
be considered in a later section) is relatively less extensive although 
it appears to be remarkably fidl in the light of the methodological 
difliculties involved in its acquisition fn the study of the prenatal 
behavior of infrahuman organisms, a number of methods are available 

^ The term “ncornte” commonl> rctus to <hc child heUcen biith md ipproxi- 
m itcly one month of jgt let Pr ilt V6) 
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for well-controlled and prolonged observation. In animals such as 
salamanders and frogs, which pass through a larval stage, the inde- 
pendent-living, immature organism can be directly observed. Bird 
embryos have been studied through a transparent window made in the 
shell of the egg. In marsupials, such as the kangaroo, the immature 
fetus completes its development in the external pouch of the mother, 
where it is readily visible. Mammals, such as the guinea pig, rat, and 
cat, have been studied by removing the ictus and keeping it in a physi- 
ological saline solution while it is still attached to the mother’s body. 
By this method the fetus may be kept alive long enough to permit 
re’ lively extensive observations. 

wSuch experimental procedures are obviously impossible with human 
subjects. Some scattered information regarding fetal movements can 
be obtained from intiospcctive reports of the mother, as well as 
through instrumental observation, as with a stethoscope or recording 
tvimbour. The principal source of data on human fetal behavior, how- 
ever, is furnished by fetuses removed from the mother by Caeaarcan 
section, when the health of the mother necessitated such an operation 
Under these conditions, the fetus has no source of oxygen and can 
be kept alive for only a relatively short time. It is during this brief 
period that the beha\ior observations must therefore be made. More- 
over, the behavior of such a fetus will be influenced by the fact that 
during this time the fetus is gradually dving from lack of oxygen. The 
probable effect of this condition is initial overactivity followed by 
underactivity. Finally, the s. all number of fctusc.^ available for such 
observations and the unsystematic, uncontrolled variation in their ages 
further increase the difficulties of ob^ lining a clear, coordinated pic- 
ture of behavior development in the human fetus. 

The study of postnatal behavior development in human infants 
obviously presents no such methodological difficulties. The obtaining 
of adequate and representative samplings of subjects for observation 
at these early ages is, however, considerably more difficult than when 
children of school age arc employed records must be precisely 
reported (preferably in terms of days) owung to the rapid rate of early 
development. The employment of standardized equipment, including 
toys, cribs, chairs, stairs, etc., is essential if comparisons among dif- 
ferent subjects arc to be made. Another important methodological re- 
finement is the use of a one-way-vision screen to eliminate the effects 
of stimulation by adult Motion pictures have frequently 
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been employed for detailed and unambiguous recording of behavior. 
Since so much of the behavior investigated at these age levels is motor, 
photographic techniques arc well adapted for such records. In the 
development of most of these methodological procedures for the 
observation of infant behavior, Gesell and his co-workers have made 
pioneer contributions (cf. 13, i4, 15, 16). 

niaiAVlOR Di:VhLOI>MENT »N INFRAIIUMAN SUBJECTS 

Although a considerable body of data had been accumulated by earliei 
investigators (cf. 4, 37), the researches conduct'd by Coghill and his 
co-workers during the first three decades of the present century rep- 
resent a major turning point in the study of early behavior develop- 
ment. CoghiU’s work exerted a profound influence upon both the 
theory and the experimental study of behavior development. One of 
the most intensive and best known of Coghilks studies was conducied 
on embryos of the salamander /4/n/?/v.s7o/nu (cf. 7 ) . In his observations 
of the movements made by this animal in response to stimulation by a 
fine hair, Coghill noted a uniform succession of stages. At first tlie 
animal is non-motilc, giving no observable response to Ihc stimulus. 
The earliest movement is a bending of the head to the right or left. In 
older embryos, this develops into a bending of the entire trunk, mak- 
ing the animal resemble the letter C. Still later, this C-reaction becomes 
exaggerated and the animal bends into a tight coil when stimulated. 
Finally, an S-reaction appears, as a combination of two successive 
and overlapping C-rcactions in reverse directions. Thus, for example, 
the first C-rcaction, toward the left, begins at the head end and travels 
by progressive muscular contractions toward the tail end; but before 
this reaction reaches the tail, a second C-rcaction, this time toward 
the right, begins at the head end. When these wS-rcactions follow' each 
other in rapid succession, their performance exerts pressure upon the 
water and enables the animal to swim away. Different stages in this 
behavior sequence are illustrated in Figure 39. 

The Coghillian S-reaction, with its characteristic antecedent stages, 
is held by many investigators to be fundamental in the locomotor de- 
velopment of many animal forms. Not only has it been observed in the 
swimming movements of numerous aquatic animals, but it also appears 
in the locomotion of land animals. It can be recognized, for example, 
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in the motion of the immature opossum or kangaroo as it travels to the 
mother’s pouch immediately after birth (cf . 4 ) . 

On the basis of his investigations on Amblystoma and other forms, 
Coghill proposed certain generalizations regarding the sequence of 
behavior development. Foremost among these is the statement that 



Fig. 39. Successive Stages in the Development of Swimming Movements 
in the Larva i'l sto/rm. (f ioir. Coghill, 7, pp. 7, 8.) 


behavior develops by iiuiividmmon from a total pattc»*n into pro- 
grcssivelv smaller units. This is viiluaily the reverse of the view that 
the earliest acts arc simple reiicxes through whose combination and 
integration complex behavior develops. From his observations, Coghih 
maintained that movements of tlie whole triiiT precede movements of 
the limbs, the latter being in turn follcmed by movements of the 
fingers. Thus, he found “that tlic hist limb movement is an integral 
part of the total reaction of the animal, and that it is only later that 
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the limb acquires an individuality of its own in behavior” (7, p. 19). 
He concludes from such findings that: “Behavior develops from the 
beginning through the progressive expansion of a perfectly integrated 
total pattern and the individuation within it of paitial patterns which 
acquire various degrees of discreteness” (7, p. 38). Two additional, 
related generalizations arc that development proceeds along cephalo- 
caudal and proxhnodisial axes. The former is based upon the finding 
that movements of the head region generally appear at an earlier age 
than movements of the rest of the bod\, and the progression tends to 
be from head end to tail end. The latter refers to the succession of 
development from the trunk outwards; the farther a part is from the 
trunk, the later, in general, vill it exhibit independent movement. 

An example of the application of these generalizations to a much 
higher animal form is furnislicd by studies of behavioral development 
in the fetal" cat by Coronios (8) and his collaborators. The precise 
age in days when a large number of diflcrcnt reactions first appeared 
was noted, together with their subsequent dc\elopmcnt or disappear- 
ance. I’or example, crawling first oecuned in the cat Ictus on die 53rd 
day follov^ing fcitilizatiori, swallowing on the 5Kt da>; the character- 
istic “righting reaction" of the cat on the 47th day; and longue protru- 
sion on the 30th day. Unilateral head bending was first observed on 
the 23rd day. and the Coghillian C-reaction was found to occur from 
the 31st to the 45th day of fertilization age. Uoronios found evidence 
of both a cephalocaudal and a proximodistal progrcs>ion in the be- 
havior development of the fetal cat. His ^inding^ also corroborated 
CoghilTs individuation theory, the earlier reactions being relatively 
dill use, unorganized movements of the entire organism, and progress- 
ing by regular stages to more precise, well-coordinated responses 
within a narrowly circumscribed area. 

Among the other conclusions reached by Coronios on the basis of 
these studies, special interest attaches to his statements that: “Before 
birth there is a rapid, progressive, and continuous development of 
behavior in the fetus of the cat. ... The ‘primitive' reactions of 
breathing, righting, locomotion, and feeding arc the products of a long 

In refeninji to punatil development, the teims '"^eiminalp "enibrvomcP .inti 
'*feiar are applied to sueecssivc stages Tn the human, loi example, the germinal stage 
lasts for apprnxim.itel\ two weeks aitci fcitilization; tiom that lime until the age of 
two months the organism is known as an embryo, and fiom two months imtil birth, 
as a fetus. The duiation ol these stages, ol coiiise, vanes with different species. 
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and continuously progressive course of prenatal development” (8, pp. 
377-378). A consideration of the extent to which behavior develop- 
ment occurs before birth, as illustrated by these observations of Coro- 
nios, is important for the proper evaluation of some of the observations 
made on postnatal behavior development. For example, in earlier 
studies by Tilney and his associates (50) on the behavior development 
of the cat after birth, emphasis was placed upon the “sudden emer- 
gence” of a number of reactions at specific postnatal ages. Several 
writers have cited these results as strong support for the role of matu- 
ration in behavior development. All that such observations may show, 
hrjwevcr, is the sudden reappearance, in the presence of the suitable 
environmental stimulus, of behavior which has undergone a more grad- 
ual development during the long prenatal period. 

'I hc role of prenatal environment in the development of behavior 
is stressed by Kuo (28, 29, 30, 31 ) on the basis of his extensive and 
carefully controlled studies of the chick embryo. The observation of 
the first appearance of dilTcrent responses tended again to corroborate 
Coghilfs hypothesis that development of behavior follows cephalo- 
caudal and proximodistal progression. Kuo reports that “head move- 
ments appear first, trunk movements next, and those of the extremities 
and tail last” (28, p. 406). 

Besides charting the order of appcarcncc of dilTercrit reactions, Kuo 
investigated the possible contributions which mechanical and other 
environmental factors operating during prenatal life might make to be- 
havior development. He no :d, for example, that the beating of the 
heart early in prenatal development produces a general rhythmic vibra- 
tion of the inert fetal body, which sta ts the head on passive mechani- 
cal movement. Similarly, the median! :al movement of the fetus by the 
contractions of the amnion stimulates the fetus to make active move- 
ments. A marked increase in active fetal movements is found at the 
period of greatest amnion activily. Fye reflexes were likewise observed 
before birth, despite the absence of visual stimuli; such reflexes oc- 
curred in conjunction with movements of the body. From observations 
such as these, Kuo concludes that every part of the muscular response 
system of the chick has been exercised before birth, and that many 
organs begin to function when still in a rudimentary form. He lays 
considerable emphasis upon the fact that the development of behavior 

^ The scic in which the fetus is enclosed. 
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is gradual and continuous, pointing out that so-called instinctive pat- 
terns of behavior do not appear suddenly, but have a long develop- 
mental history. 

Our knowledge of prenatal behavior has been considerably ad- 
vanced by the thorough and detailed studies of the fetal guinea pig 
conducted by Carmichael and his co-workers (4, 5). The time of first 
occurrence as well as the nature of the many responses in this animafs 
prenatal behavior repertory was determined through carefully con- 
trolled studies. In addition, a number of more specific investigations 
dealt with such problems as the response to temperature and to pres- 
sure stimulaiion of varying intensity, fhe latter studies indicated the 
influence of stimulus intensity upon the nature of the response, a light 
pressure on a particular point of the skin, for example, eliciting only 
an eye wink, wliile a more intense pressure on the same j)oint led to 




Fig. 40. Contra>ling Reactions ol the Fetal Cimnea Pig to Fight and 
Heavy Pressiiie Stimulation. (From C’armichael and Smith, 6, p. 432.) 

movements involving head, entire trunk, and all four limbs. These two 
responses are shown in Figure 40. U should be noted in connection 
with CoghiU’s generalizations that the more intense stimulation tended 
to evoke a more general response, while weaker stimuli called ft^rlh 
more specilically localized reactions, regardless o/ the age oj the fetus. 

Carmichaefs results on the whole suggest the need for qualifying 
any generalizations that had previously been proposed regarding the 
course of behavior development. ITus he found that the earliest be- 
havior of the guinea pig includes some responses which arc “general- 
ized" in Coghilfs sense, but also some which arc highly specific and 
narrowly localized. Carmichael suggests that many of the early re- 
sponses which appear to be general may actually be scries of responses 
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to successive proprioceptor stimulation. Sucli stimulation, spreading 
through the organism, may lead to responses which may easily be mis- 
taken for diffuse and iindifTcrentialed behavior. C'armichaers studies 
also revealed a number of exceptions to the cephalocaudal and proxi- 
modistal progression of behavior development, indicating that these 
generalizations, too, need to be qiiahiied. 

PRENATAL BEHAVIOR DLVEIOPMENT 
IN HUMAN SUILIEC IS 

The major studies of human fetal behavior reported to date arc those 
by Minkowski (34 j, Boiaflio and Artom (2), Hooker (25, 26), and 
Sontag (45, 46). In f looker’s investigations, moving-pictuie records 
supplemented the steivigraphic notes of the earlier workers, thus mak- 
ing possible later, detailed analyses ol responses. Hooker’s Frcliminary 
Atlas oj Early Human Fetal Activity (25), based upon these photo- 
graphic records, is a valuable source of data on human fetal behavior. 

The research on fetal behavior being currently conducted by Sontag 
and his associates at the Fels Research Institute, as a part of the de- 
velopmental studies of the Institute, has already furnished promising 
results on a number of questions. 

The available evidence indicates that all receptors arc probably 
capable ol lunctioning in the fetus, ahhough the conditions of prenatal 
life are such that vision, taste, smell, and temperature are not likely 
to be stimulated, Since cha'’-:e i' a primary requisite for stimulation, 
the uniformity of llie prenatal environment in such characteristics as 
temperature and chemical composition makes the activation of certain 
receptors unlikely. Responses to aiK'itory stimulation in utcro have, 
however, been noted. Touch stimulation has been extensively studied 
and lotind to occur shortly after the eighth week. Although a general 
cephalocaudal and proximodislai succession may be observed in the 
development of such sensitivity, exceptions have again been noted. 
Proprioceptors probably begin to function at the time when the first 
active rnovernenis are made, when the ictus is about two months of 
age. The importance ol such stimulation in producing what appear to 
be generalized movements of ilic entire organism is again emphasized 
by Carmichael, in his survey ol the available data on fetal behavif^r. 

‘‘The jMopnoccpi'iis are located in the muscles, tendons, and joints, and furnish 
stimulation to the organism iioin its own movements. 
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Inreferencetoorganicsensesinthefetus, Carmichael writes: . . it 

may be said that there are possibly certain organic changes in the 
stomach, intestines, heart, and vascular and respiratory systems which 
occur before birth and which may be important in receptor stimula- 
tion” (4, p. 131 ). 

Thus it is apparent that the fetus has various sources of stimulation 
in its normal prenatal environment. Stimulation may arise from inter- 
nal changes, as well as from tactual and proprioceptive impulses re- 
sulting from movements of the Ictus itself. The “spontaneous” move- 
ments noted in the fetus probably represent responses to such stimuli, 
unrecognized by the observer. Mention may also be made (T the possi- 
bility that the passive movement of the fetus by the contractions of the 
amnion, occurring after the third month, may in turn stimulate active 
motor responses, as was suggested by Kuo for the chick embryo (cf. 4, 
p. 104). 

Following the first two months of prenatal life, a variety of motor 
responses have been observed. While many of these respon'^cs tend to 
involve the entire organism, a number of local reflexes also appear 
early in the course of development. By the fourth month, nc^arly all 
the reflexes of the newborn can be elicited. It is interesting to note that 
a number of reactions whose proper stimuli are not present until after 
birth are nevertheless found during prenatal life. Thus crying, suck- 
ing, and eye reflexes, for example, have been observed in the fetus. 
Among the early movements of the fetus arc rhythmic contractions of 
the chest and thorax, similar to breathing movements, which can be 
recorded directly through the mother’s body. Since the fetus is sus- 
pended in a liquid, true lung breathing is, of course, impos'sibic; but 
the neuromuscular mechanism of breathing may be exercised and 
strengthened by these movements (cf. 4, p. 104). 

A number of fettil responses appear also to be the precursors of 
later postural and locomotor behavior of the infant. Reflex “balanc- 
ing” responses, h)r example, have been observed in the fifth month of 
prenatal life. At this time, movements of the head in space lead to 
eqiiilibratory movements of the limbs. Passive movement of the fetus 
produces active reactions which return the fetus to its original position. 
Stimulation of one foot in a five-month fetus may lead to the bending 
of the corresponding leg and the extension of the opposite one; a 
diagonal response of the opposite hand may also result. This “trot” 
reflex may underlie such postnatal activities as crawling and walking. 
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An increase in the amount and rate of fetal movement has been 
observed during emotional excitement of the mother (45) . Apparently 
the physiological changes occurring in the mother’s body during emo- 
tional stress excite the fetus to greater activity. There is also some 
evidence that hyperactivity, excitability, and feeding difficulties may 
occur for several months in infants whose mothers have undergone 
severe emotional strain during pregnancy ('45) . Tlie physiological con- 
ditions of prenatal life associated with the mother’s emotional state 
may thus have effects which continue in postnatal life. 

Can learning occur during prenatal life? The question is a contro- 
versial one, but there is evidence that at least simple modifications of 
behavior do occur following stimulation. Adaptation to a vibratory 
stimulus applied to the mother’s abdomen has, for example, been ob- 
served in the fetus. Ordinarily such a stimulus will evoke a typical 
“startle reflex” involving sharp, convulsive movements and a sudden 
rise in heartbeat (45, 46). A definite decline in this startle response 
was noted in a fetus repeatedly stimulated in this manner over several 
weeks (45). Rudimentary conditioning has also been obtained in 
human fetuses in iitero during the last two gestation months 
In these experiments, a loud noise served as the unconditioned stimu- 
lus, and a vibratory stimulus applied to the mother’s abdomen as the 
conditioned stimulus. Approximately 15 to 20 paired stimulations 
were required to establish the conditioned reaction to the point at 
which 3 or 4 successive responses to the vibratory stimulus alone were 
obtained, while additional reinforcements led to as many as 1 1 suc- 
cessive conditioned reactions. Experimental extinction, spontaneous 
recovery, and retention of the response over a three- v\ cek interval were 
also demonstrated. Several investigate rs (cf. 35, p. 209) have succeeded 
in establishing conditioned reactions in the neonate to a variety of 
stimuli. The suggestion that some of the responses of the fetus and the 
neonate may be conditioned reactions has been made by Holt (24) on 
theoretical grounds. 

BEHAVIOR DEVELOPMENT HUMAN INFANTS 

Mention has already been made (cf. Ch. 2) of the detailed normative 
scales which chart the normal course of postnatal behavior develop- 

may also be noted that Gos (17) reports conditioning of the chick before 
it has emerged from the egg. 
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ment in the human infant and young child. The investigations of Gesell 
( 13, 14, 15, 16) and his co-workers at the Yale University Clinic of 
Child Development are probably the most cxtcnsi\c and well con- 
trolled of such normative developmental studies. Since the behavior 
reperiory of the child during the first few years consists so largely of 
simple sensory and motor functions, most of the data concern this type 
of activity. 

Gesell concludes from various findings that the behavior develop- 
ment of the infant and child depends primarily upon ‘"growth” or 
“maturational” factors, rather than upon learning. As one source of 
evidence, he cites observations c»n pre-tenn and post-term babies. In- 
fants born one month before the nornial nine-month gestation period 
do not reach the developmental level of the normal newborn child 
until the age of one month. Similarly, a baby born after a ten-month 
gestation period will be as far advanced in its behavior at birth as a 
normal one-month-old child. Yet, Gesell points out, there is a vast 
difference between the prenatal and postnatal environments in the 
opportunity lor learning and for the specific excicisc of behavior 
liinctions. 

Gesell likewise points to the amsistau v of de\ clopmcntal seeptetn es 
as e\idence for maturation. In the development of prehension be- 
havior, for example, the successive stages follow in the same order 
and at appioximatcly the same ages in different children. Thus the 
child’s reactions towarvl a small sugar pellet placed in front of him 
show^ a charactcri^^tic chronological sequence in visual fixation and in 
hand and finger moA ements. Use of the entire hand in crude attempts 
at ‘"palmar prehension.” lor example, occurs at an earlier age than the 
use of the thumb in opposition to the palm; this is in turn followed by 
the use of the thumb and index finger in a more efficient “pinccr-like” 
grasp of the pellet. Such sequential patterning is likewise reported for 
walking, stair climbing, and most of the sensori-motor development of 
the first few years. 

A similar emphasis upon maturation is to be found in other studies 
of behavior development in infancy (cf. 1,9, 1 1, 21, 33). Shirley (41, 
42), in a study based upon weekly and biweekly examinations of 25 
infants from birth to two years, concludes that “motor control in 
infancy begins headward and travels footward,” thus supporting the 
principle of cephalocaudal sequence. In a summary of her observa- 
tions, she writes: 
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The first stage of the motor sequence was eye-coordination, which had 
as sub-stages furtive pursuit movements, fixation of persons and objects, 
and following moving objects. During the same interval postural control 
moved down the body from head-turning and head-lifting to chest-lifting 
in the prone position. Likewise there was gradually less need for support 
of head, neck, and back when the baby w'as held on the shoulder. The sec- 
ond period saw advancement in postural control and development of 
reaching. When the baby was seated on the evaminer’s lap her hands gave 
him support first at arm-pits, then at mid-ribs, abdomen, and finally none 
save the cupping of her lap. Reaching progressed from random waving 
toward the object through touching, grasping, retaining with thumb oppo- 
sition, to the goal of canying object to mouth. With the advent of sitting 
aljne motor control had migrated down to the sacral region, and with the 
use of the index finger for pointing, which occurred about the same age, 
it had crept out to the finger tip (41, p. 204). 

Shirley’s strong leanings toward a hereditary interpretation are ex- 
emplified in the following quotation: 

Can an order which holds so universallv be attributed to training or 
even to spontaneous practice? Does not its conformity to the anterior- 
posterior growth law make the motor sequence a normal unfolding fsic! | 
ot developmental pioccsscs — in other words, ii I unction ol maturation? 
(41, p. 205). 

In evaluating the conclusions reached by such investigators, the f(fi- 
lovving points ought to be borne in mind. First, the possibilities for 
prenatal exercise of various functions as well as for prenatal learning, 
suggested in tiie preceding .seussion, should be noted. Dennis (10) 
has argued against such a view. Pointing out that neither operant con- 
ditieming nor trial-and-error lc:n juig has been demonstrated in 
the luirnan fetus or neonate, he concludes that most of the behavior 
of the infant ai birth is indearncd and that fetal development is almost 
entirely a matter of maturation. The question is certainly not yet 
settled, however, and requires nioie than the relatively meager infor 
mation now^ available on human prenatal development. It should be 
noted, furthermore, that ihe simple exercise of various functions, 
initiated and determined by the physical conditions of the prenatal 
environment, might still infinence subsequent behavior development, 

'^Operani conditiomrn? involves cfiiiciusoninp, of landoni, spontaneous, oi 
self-initiated acts, and is tiins the coiintcip.n' ni condiljoned-icsponsc terminoUygy, 
of tnafand-ci roi leaniiiip. 1 or a uioic UxhniCdl dibcussioii of liic concepts of operant 
and resiiondent behavior, cL Sknuici (44 >. 
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even if operant conditioning and trial-and-error activity are shown to 
be impossible in the fetus. 

In the second place, the importance of guarding against easy gen- 
eralizations must again be emphasized. Superficially, infant behavior 
may appear to fit the cephalocaudal and proximodistal sequences, but, 
as in the case of infrahuman and prenatal development, exceptions 
can be found. 

Thirdly, as applied to certain forms of behavior, these generaliza- 
tions seem to be the result of the physical exigencies of the situation 
rather than the result of a developmental progression in the locus of 
motor control. Let us consider, for example, the observation that the 
baby, in order to sit up, requires support first at the armpits, then at 
mid-ribs, then at the abdomen, and finally none besides the cupping 
of the holder’s lap. Now, if the child is supported at the abdomen, his 
own muscles have to support the upper half of the body. If, however, 
he is supported at the armpits, the intervening parts of the body arc 
thereby supported as well. Thus the “armpit support” is physically 
more complete than the “abdominal support.” I'he former would nat- 
urally be required with younger children whose motor control is still 
inadequate. The same may be said of the fact that sitting up f^rccedes 
standing. The child has less to support through his own muscular 
efforts when seated than when standing. Similarly, it could be argued 
that finger movements, in order’ to be used effectively, require more 
delicate and finer adjustments than do the grosser movements of hands 
and arms, and may appear later simply because they require more 
highly developed muscular control. Certainly no special law of “inner 
growth” is needed to account for the fact that movements requiring 
the coordination of more muscle groups or those requiring finer coor- 
dination appear later than movements requiring fewer muscles, less 
strength, or less delicate coordination. Learning ordinarily progresses 
from the simpler, easier aspects of a task to the more complex, and 
would thus be completely consistent wath the type of progression ob- 
served in these developmental studies. 

A fourth point to be noted is that uniformities of developmental 
sequences may result in part from certain basic environmental similari- 
ties in the average American homes in which the children have been 
developing. Not only deliberate instruction on the part of adults, but 
also other, unplanned uniformities in the child’s physical and psycho- 
logical milieu need to be considered. Under these conditions, a regu- 
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larity of sequence need not necessarily imply maturation. Finally, it 
should be remembered that the behavior under investigation, being 
largely of a simple sensori-motor nature, would naturally depend upon 
the level of structural maturity attained at the time. It would be unwar- 
ranted to generalize from such observations to the development of 
more complex, symbolical activities in the older individual. 

STRUCTURAL CORRELATES OF BEHAVIOR DEVELOPMENT 

Any contribution which hereditary factors may make to behavior de- 
velopment must obviously operate through structural characteristics, 
which would in turn set certain limiting conditions to the acquisition 
of behavior. As was pointed out in Chapter 4. this is the only way in 
which such hereditary influences may be manifested. To study what 
structural changes occur concomitantly with the observed develop- 
mental changes in behavior thus constitutes another approach to the 
heredity-environment problem. 

Coghill (7) was the first to trace systematically the role of specific 
anatomical relationships within the nervous system in the development 
of a particular behavior function. In his studies of the salamander 
larva, Coghill noted that while the animal is still in a non-motile stage, 
there are both sensory and motor nerve fibers in contact with receptors 
and muscles, respectively, but there is no central connection between 
the two. At this stage, the animal does not respond to tactual or 
chemical stimulation of lh<‘ skin, however intense. Regarding subse- 
quent devclofwncnl, Coghill writes: 

With the ability to lespond to tactile >r chemical stimulation of the skin 
there appears a third senes of ceils. Tli,y biidgc the g.ip between the sen- 
sory system of one side and the motor system of the other. Their bodies 
lie in the floor plate of the medulla oblongata and upper part of the 
spinal cord. In the non-motile stage these cells are unipolar. The one pole 
of the cells extends either to the right or to the 'eft into close relation with 
the motor tract on one side only. When they become bipolar, they com- 
plete the path from the sensory field to .oe muscle ... (7, pp. 12-13). 

The presence of these bipolar cells is sufficient for the development 
of muscular responses through the “coil” stage described in the pre- 
ceding section. Additional neural structures must appear, however, 
before swimming movements can occur. In summarizing this stage, 
Coghill writes: 
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At the time swimming begins there is a growth of collaterals from fibers 
of the anierior part of the motor tract into relation with the dendrites of 
floor plate cells. These collaterals cause an excitation that is on its way 
to the muscles of one side to be carried through the commissural cells of 
the floor plate to the motor system of the other side. But in this passage 
to the muscles of the opposite side more synapses are involved than there 
arc in the path to the muscles of the same side; so that the second flexure 
follows the first by a very brief interval. In the same manner as the impulse 
for the first flexure excited the second flexurL so the impulse for the 
second excites the third, and so on (7, p. 14). 

Thus wc sec how the delicately timed succession of excitations from 
side to side could serve as a basis for the overlapping right and left 
C-turns of the body which constitute the swimming movement ot this 
organism. Coghill charted similar relationships between the anatomical 
growth of the nervous system and the appearance of other behavior 
functions, such as feeding behavior. 

A similar, but usually much less complete, neural basis has been 
worked out lor the earliest behavior development in a number of other 
animal forms, such as the rat. Through anatomical and histological 
studies, the development of sensory structures has likewise been re- 
lated to the appearance of corresponding behavior functions. Some 
information regarding the neural structures underlying early human 
behavior development has been obtained from histological studies of 
human fetuses after death. Observations of non-motile fetuses, for 
example, have shown that in such cases sensori-motor connections had 
not yet been established in the nervous system. 

All the investigators whose observations of human fetal behavior 
were cited in the preceding section have also condiieled experiments 
designed to determine the functioning of dilTerent levels of the nervous 
system at various prenatal stages. EAtirpation of the cerebral hcjui- 
spheres has been found to have no elTect on the motor responses of 
the early fetus. Until about the end of the third month, the cerebral 
cortex appears not to function in behavior. Hven after three months (T 
fetal age, the eflcct of cortical removal is slight and not observable in 
all cases. The only observed effect of decortication at any time during 
prenatal life is an increase in the intensity of local reflcxei following 
cortical removal, the cortex having presumably exerted an inhibitory 
influence. Sectioning of the spinal cord in a motile fetus, on the other 
hand, abolishes certain reflexes in the corresponding body areas. Re- 
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moval of the cord leads to complete cessation of scnsori-motor 
responses. 

Electrical stimulation oj the cortex generally produces no response 
in the human fetus, although such stimulation applied to lower brain 
centers or to the spinal cord produces the appropriate muscular con- 
tractions. In one investigation (2), some evidence was presented that 
stimulation of the cortex during the last prenatal month arouses activ- 
ity. Studies of the electru al brain potentials of the human fetus have 
recently been made by means of electroencephalograms obtained in 
iitero (32). These als(^ show little or no evidence of brain activity 
brforc birth. It is generally believed, from a variety of evidence, that 
cortical control of behavior in the human does noi begin until some 
time after birth. 

A study of the bract potentials of the fetal guinea pig (2“^), how- 
ever, gave evidence of elcelrical activity beginning at about two weeks 
before normal birth time. In slightly older fetuses, the effects of stimu- 
lation could be rccogni/ed by definite, though not invariable, changes 
in the electroencephalogram. The authc'rs citneluded that: ''The guinea 
pig brain first exhibits electrical activity at a time when behavioral 
indications also point to maturation of highei nervous centers’' (27, 
p. 71 ). 

Another attempted apprcnich to the identification of the structural 
correlates of behavior was based u|)on the tuyelination of the nerve 
fibers. Before impulses can be conducted along discrete paths from 
receptor to nervous system md from ihere to the muscles, the nerve 
fibers must be insulated. It was suggested that the myelin sheath which 
surrounds the nerve fiber provides su. h insulation and that the appear- 
ance of the myelin covering would ^hus serve as ,an indicator of the 
time when different nerve paths begin to function. Tilney and Casa- 
niajor (49) applied this method to their studies of behavior develop- 
ment in the newborn cat and guinea pig. Although these investigators 
reported that the specific reactions which they observed did not appear 
until the corresponding nerve fiber" had become myelinated, these 
findings were nut corroborated by oiiier workers with the same or 
other animal forms. It is now generally recognized that myelination is 
not a prerequisite for the functioning of nerve fibers. 

Several other factors have been proposed as essential antecedents 
for sensori-motor functioning. It has been suggested, for example, 
that the real carriers of the nerve impulse are the nemo fibrils, tiny 
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threadlike processes which run through nerve tissue. If this is the case, 
then the onset of functioning of particular nerve paths could be deter- 
mined by observing the first appearance of the neurofibrils. Some 
investigators have noted relationships between the presence of the 
neurofibrils and behavior development, but the role of the neurofibrils 
is still insufficiently established for the proper interpretation of these 
findings. Since chemical changes occur during neural functioning, the 
presence of certain chemical substances has been regarded by some as 
a possible antecedent of such functioning. In his studies on the chick 
embryo, Kuo (31), for example, found that the first true ncurally 
mediated responses do not occur until after the presence of acetylcho- 
line can be detected. 

This brief examination of typical procedures and findings should 
serve to characterize the present state of knowledge regarding the 
structural correlates of behavior development. The clear-cut identifi- 
cation of structural changes underlying the appearance and develop- 
ment of particular behavior functions is rare. Conclusive results have 
been largely restricted to simple behavior in relatively low animal 
forms. Much of the available information, especially as it pertains to 
the human, is highly tentative, exploratory, and sketchy. Certainly the 
available data furnish no justification for many of the statements fre- 
quently made regarding the neurxil basis of complex behavior charac- 
teristics in man. Such speculations may possibly be of some value in 
suggesting problems for research. But it is important that their specu- 
lative nature be clearly recognized. 
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Psychological Factors in Simple 
Behavior BeveJopmeni 

The present chapter is concerned with studies specifically designed 
to determine the variations in behavior development resulting from 
changes in environmental conditions. The investigations to be con- 
sidered are similar to those treated in the preceding chapter in that 
they deal with relatively simple behavior. Moreover, the organisms 
studied are themselves either at a relatively low level in the phyletic 
scale or are studied at an early stage in their development. In the 
majority of these investigations, environmental conditions were system- 
atically varied by the e:Kpcrimenter and the corresponding changes in 
behavior were observed. This is true of all the studies on injmihunuin 
subjects, to be discussed in the first scclion. The second section deals 
with investigations of human infants by the method of co-twin control. 
In such a method, environmcntalconditions are altered by experimen- 
tally providing additional training for one member of each pair of 
twins. In the third section of the present chapter we shall consider the 
method of experimentally restricting the training of human infants, a 
method of which relatively little use has been made.^ 

Although not, strictly speaking, experimental, the two remaining 
groups of studies covered in the present chapter are closely allied to 
the methods described above. The observation of the behavioral effects 
of infant-rearing practices in different cultures, as well as the case 
reports of "'feral man'* which have appeared from time to time, may 
be regarded as relatively crude, unplanned “experiments” on the 
effects of environmental variation. 

^ The three methods discussed above logically fall under category' 4 in our listing 
of the approaches to the problem of heredity anci environment, given in Chapter 4. 
For surveys of such studies, cf. 7, 32. The other two types of study to be considered 
in me picscnt chapter fall under categories 5 and lespectively. 

164 
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EXPERIMENTALLY PRODUCED CHANGES 

IN ANIMAL BEHAVIOR 

A number of behavior functions commonly regarded as unlearned or 
“instinctive” have been subjected to experimental control, whereby 
the animal was prevented from exercising the function until well past 
the age when such a function normally appears in the species. By such 
isolation of factors, the attempt is made to determine the extent to 
which the physical rnaturaiion of the necessary structures will in itself 
lead to the performance of the given function. Other experimenters 
have followed the opposite procedure of providing additional intensive 
training in order io determine how tar normal behavior development 
can be accelerated. 

One ol the pioneer experiments on maturation vcisiis learning was 
that of Shepard and Breed (34) on the pecking behavior of chicks. 
One of these authors (Breed, 3) had previously observed that the 
pecking response invoked three separate reactions — striking, seizing, 
and swallowing — in each of which an error could be made. Defining a 
successful response a^ one in wdiich all three processes were com- 
pleted, he lound that, during the first month alter hatching, the accu- 
racy of chicks in peeking at grams normally rises from about 15^^^ suc- 
cessful trials to about S49^. Is such improvement the result of the 
intervening practice or does it follow from the structural development 
of the sensory, motor, and neural mechanism? To answer this c|uestion, 
Shepard and Breed (34) kept three groups of newly hatched chicks in 
darkness lor periods of thr^ , four, anc^ five days, respectively. During 
this time the chicks wx-re fed by hand, the experimenter conveying food 
and water directly into their moutl >. Because of the darkness, the 
chicks had no opportunity to peck at any objects. In 'erms of the 
above-mentioned criterion ol successful responses, the older chicks — 
whose pecking had been delayed longer — did no better initiallv than 
those which had started to peck at a younger age. The older chicks, 
however, progressed more rapidly. In other W('rds. the groups that had 
been delayed for several days requin •! fewer days of practice to reach 
the maximum level of accuracy than did the chicks which had had 
unrestricted opportunity to peck from birth. The chicks which had 
reached a more advanced stage of physical development thus profited 
more from practice than did the younger chicks. These results seem 
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to suggest that, although some practice is necessary, the pecking re- 
sponse also depends upon maturation. 

Although widely quoted, the results of this early experiment were 
inconclusive for several reasons, not the least of which were the small 
number of cases and the wide variability of performance among indi- 
vidual chicks. More recently, the experiment has been repeated by 
several investigators, with certain methodological modifications. The 
most carefully controlled and thorough analysis is to be found in the 
investigation by Cruze (10)- A total of 202 chicks from comparable 
stock were separated into eight experimental groups, each containing 
25 or 26 chicks. The first five groups, A to E, were kept in darkness 
for 1, 2, 3, 4, and 5 days, respectively. This part of the experiment 
thus represents a repetition of vShepard and Breed's investigation, 
under more carefully controlled conditions. 

The results were also similar to those of the earlier investigators, 
as showai in Figure 41. It will be noted that, although each group is 
about equally poor in initial performance and all show marked im- 
provement with practice, iliis improvement is more rapid fnr the (fider 
chirks. Thus the final level of 20 out of 25 successful trials was reached 
after 15 days of practice by (iroup y\, whicli began to peck after only 
one day of confinement in the dark, i.c.. when the chicks were only 
two days old. At the other extreme. Group E., confined for 5 days, 
reached the same level of accuracy after only 7 days of practice. 

Cruze did not, however, stop at this point, but set out to control one 
more factor, viz., the amomu oi (laily ptai tire subsequent to removal 
from the dark room. Ciroups A to E had been free lo peck, and did 
in fact peck extensively, outside of the daily test periods. In other 
words, although practice had been prevented during the period of 
dark -confinement, no control was exerted over the amount of subse - 
quent practice. Three additional groups, F, G, and H, were kept in 
darkness at all times except during the daily tests. a\I 1 three groups 
started (he pecking tests after one initial day in darkness and are 
therefore comparable in this respect lo Group A. The results obtained 
with each of these four gnuips are given in Figure 42. Group A, with 
25 trials a day plus unlimited outside practice, quickly oulslrippcd the 
other three groups in performance. The practice undergone by 
Group F was restricted to the 25 daily test trials, and its progress is 
correspondingly slower. Group G, allowed only 12 daily test trials, is 




Fig. 41, Pecking Pcrtormance of I'ive Groups of Chicks Prevented from 
Pecking foi 1. 2, 3, 4, and 5 Days, Respectively. (Froin Cruzc, 10, p. 386.) 
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clearly at the bottom of the groups. In fact, this group shows an 
almost horizontal curve, with only slight improvement during the first 
lew days. An interesting variation was introduced in Group H, which 
had 12 daily test trials for 10 days, followed by 25 daily test trials 
during the next 10 days. The sharp rise in the curve of this group after 
the tenth day clearly reflects this change in amount ot practice. During 
the first 10 days, progress in this group had been very slow, the curve 
coinciding closely with that of Group G. 

Jt is apparent from this part of the experiment that extensive prac- 
tice is essential for reaching a high level of attainment in the pecking 
response. Only a slight initial improvement could be directly attributed 
to maturation. Further analysis of the nature of the errors, moieover. 
demonstrated that the role oi maturation is restricted primarily to the 
development of the striking response. The swallowing lesponse is a 
reflex which is present bclore hatching and shows little change in any 
of the groups. The accuracy of the seizing response was found to 
depend closely upon practice; this aspect of the pecking function evi- 
dently accounts in large part for the eflects of practice noted in the 
case of the total function. 

In a comparison of the present investigation with the earlTer work 
of vShepard and Breed, two points are noteworthy. First, with more 
refined methodology, the important role ol learning becomes apparent 
in the performance of a function which superficially appeared to 
depend more largely upon maturation. Secondly, the analysis of a 
complex function may show some of its component activities to be 
largely the result of learning, others largely the result of maturation. 
Any attempt to characterize the function as a whole in terms of learn- 
ing or maturation would thus be misleading. 

In a series of investigations by Carmichael (4, 5, 6), the role of 
maturation and learning in the swimming of tadpoles was investigated 
by a similar method. While the newly hatched animals were still in 
a non-motile stage, they were separated into two groups: one of 
these, the control group, was allowed to develop in ordinary tap water; 
the other group was kept in a weak solution of chloretonc. Although 
not interfering with neuromuscular growth, the chloretonc produced 
complete immobility, thus making practice impossible. When the 
control group was swimming normally, the drugged tadpoles (who had 
reached the same stage of structural development) were removed to 
fresh water. Within 30 minutes, these tadpoles are reported to have 
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been swimming as well as the control group, which had had five days 
of swimming practice. 

In a further experiment, Carmichael (5) demonstrated that the 
30-minute delay represented simply the time required for the effects 
of the drug to wear off, and could not be regarded as a period during 
which rapid learning might have occurred. In still another experiment 
(6), in which isolation from external stimulation was employed in- 
stead of anaesthetization, essentially the same results were obtained. 
From these investigations, Carmichael concluded that the swimming 
of tadpoles can be classified as unlearned behavior, depending solely 
upon structural development. 

More recently, Fromme (17) repeated this investigation with a 
number oi modifications. Chloretone was again used to induce immo- 
bility, the drugged and control tadpoles being carefully equated m 
terms of their precise stage of morphological development. A quanti- 
tative measure of the velocity of swimming was obtained by testing the 
animals in a narrow trough wliich permitted only straight-line swim- 
ming. With this measure, the investigator was able to show that, al- 
though all the animals were able to swim shortly after rcmowil from 
the drugged water, significant differences in speed and distance of 
swimming were present between the drugged and control groups. 

Moreover, the developmental ,stage at which the animals w'cre re- 
leased from the drug influenced the results. Thus one group, released 
at the stage when the first movement normally occurs, showed no 
difference in swimming ability from the control group. This is to be 
expected, of course, if the only efiect of the drug is to produce immo- 
bility. A second group was allowed to develop in ordinary waiter until 
the appearance of the elementary movements which normally pre- 
cede swimming. Tlien it was placed in the drugged water, where it 
remained until it reached the stage of structural development charac- 
teristic of the free-swimming tadpoles. When released and tested at 
this time, the group swam more poorly than the control group.- 
Finally, a group anaesthetized throughout its development and re- 
leased and tested at the stage when swimming normally occurs showed 
an even greater inferiority, when compared with the control group. * 

7’he (lifTerencc between e\pcrinient:il and eonliol groups was statistically s.gnili- 
cant, the critical ratios being 5.2 for speed of swimming and 5.3 loi distance, (bor an 
explanation of statistical tests of significance.^cl Ch. IS). 

The critical ratios of these differences were 9.5 and 7.1 foi speed and distance 
measures, respectively. 
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From all these results, Fromme concludes that, “although the develop- 
ment of structure may be explained completely in terms of the growth 
process, the development of behavior is determined only in part by 
the structures produced by growth and is affected by its own be- 
havioral antecedents as welT’ ( 17, p. 235)/ 

Similar experiments have been conducted on such activities as the 
flying and singing of various species of birds, and the behavior of cats 
in catching and killing mice. The role of maturation and learning in 
the development of sexual behavior has also been investigated in a 
numbci of animal forms. In general, such studies have indicated that 
s-mie sexually deterinined activity occurs when certain developmental 
stages are reached, as a result of endocrine secretions and other 
physiological factors. The specific w^ay in which such activity is ex- 
pressed and the ()bj».et toward which it is directed, however, vary 
according to environmental circumstances. 

In an experiment on male doves reared in isolation from other 
members the species, a number of sexual '"abnormalities'’ were 
observed (9). llie birds w'ould bow and coo to the experimenter as 
normal birds do to members of their own species. They seemed to pay 
especial attention to the experimenter’s hand, with which they came 
into contact when fed; one bird actually went through the act of 
copulation while on the hand taking food. Female doves reared in 
isolation developed similar anomalies of behavior (8). If the experi- 
menter stroked them and pn^cned the feathers of their head and neck, 
they exhibited characteristi cv^irting behavior. Egg-laying was even 
induced in many instances by ihis method. Experimental "homosex- 
uality” was produced in a large nu nber of cases wlien two female 
pigeons were reared together In si eh cases, the animals displayed 
the usual courting perfc-nnancc toward each other, followed by egg- 
laying on the part of both animals. 

FA|ually pronounced variations of behavior were noted in a young 
monkey separated from its mother at the age of three days and brought 
up in isolation from all members o^ the species during the first 18 
months of life (15, 16). The development of sexual behavior in gen- 

Further expei iments on clTccts of additiomd siimuUition, in which the 
ani.iials were kept in const.intly agitated water, yielded inconclusive lesults. Several 
possible reasons may account Toi this: eg., ti-e nature of Ihe stimulation employed 
may have produced othci comi'hcating cllects; or the ainoun' of pract ce obtained 
by these animals in their normal swimming may be such that additional exercise wilt 
yield rapidly diminishing leturns 
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eral was markedly delayed. During the period of isolation there was 
a minimum of the sex behavior ordinarily displayed by monkeys at 
that age. At the age of 18 months, the period of isolation w'as discon- 
tinued, and the monkey was subsequently brought up with other 
members of the species. At this time, sex behavior began to appear* 
but in a very rudimentary form. Attempts at copulation were very 
crude and trial-and-error was exhibited. Sexual activity was shown 
indiscriminately toward males and females, as well as toward mon- 
keys of other species, rags and other soft objects, and the experi- 
menter’s arm and hand. With continued association with other mem- 
bers of the species, normal sexual activity eventually developed. 
Other forms of behavior, such as feeding, play-activity, and grooming, 
were also affected by the prolonged period of isolation. 

On the whole, sexual behavior is more closely dependent upon 
physiological factors in lower mammals, and shows an increasing 
modifiability and susceptibility to experiential factors in higher forms 
(2). For example, lower mammals such as rodents can generally 
copulate successfully on the first trial. This is less true among monkeys 
and still less among the anthropoid apes. It has been obsciwed that 
some male chimpanzees have to learn to copulate, their first attempts 
being usually unsuccessful (40). Mention may also be made in this 
connection of the extensive survey conducted by Kinsey and his co- 
workers (26) by means of a carefully planned interview technique. 
This study indicated that, in the human male, sexual activity may be 
manifested in a wide variety of ways. The role of cultural factors in 
determining the occurrence of difl'erent types of sexual behavior is 
also suggested by the findings. 

A number of experiments have been conducted on the hoarding 
behavior observed in many animals (31). For example, rats which 
arc allowed access to a supply of food will carry to their home cage 
and store a much greater quantity of food than they can consume. 
Adult well-fed rats will hoard 5 to 20 pellets a day, although they eat 
only one or two each day. This characteristic behavior is probably 
associated with physiological factors, such as the sugar supply in the 
body. It has been shown to increase with low temperature and with 
food deprivation, both of which produce bodily conditions which 
may be conducive to the hoarding behavior. At the same time, it is 
interesting to note that experiential factors also seem to affect and 
modify this behavior. Thus some experimental data suggest that peri- 
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ods of food deprivation in infancy may influence adult hoarding 
behavior in rats (21, 22). 

The experiment by Kellogg and Kellogg (25), in which a young 
chimpanzee was reared for a short period in a typically human envi- 
ronment, provides a further illustration of the role of environment in 
behavioral development. The chimpanzee, a female named ‘‘Gua,” 
was isolated from its mother at die age of IVi months and brought 
up in the company of the investi- 
gators’ own son, then 10 months 
old. The association was contin- 
ued for a period of nine months. 

1 he chimpanzee was not treated 
as a pet, but as a child, and the 
two subjects were given as nearly 
identical care as possible. 

Gua was clothed in the same 
manner as the child, and showed 
no difficulty in keeping on shoes, 
stockings, and otlier common ar- 
ticles of clothing. She slept in a 
bed with the usual accessories, 
such as sheets and blankets. Up- 
right locomotion, not normally 
found in chimpanzees, was also 
acquired by this animal. Excellent 
progress was made by the ape m 
learning to eat with a spoon and 
drink out of a glass, as illustrated 
in Figure 43. She was able to 
manipulate pencil and paper to 
produce simple scribblings. Gua also learned to respond to oral lan- 
guage, and by the termination of the experimental period understood 
over fifty words or simple phrases, such as: "Blow the horn” (in the 
car); ‘'Show me your nose”; “Do yc/ want to go bye-bye?” “Take it 
out of your mouth.” The degree to which it proved possible to “hu- 
manize” the behavior of this ape is indeed suggestive, especially in 
view of the fact that the period of residence in the human environ- 
ment w as of relatively short duration and did not b'^gin at birth. 

A similar experiment was subsequently undertaken by Finch. ^ A 



Fig. 43. Gua: A Chimpanzee 
Reared for Nine Months in a Typ- 
i’^ally Human Environment. (From 
Kellogg and Kellogg, 25, p. 226.) 
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male chimpanzee, “Fin,” was removed from its mother within one 
day of birth and kept in a human environment until the age of 2 
years and 3 months. Two human siblings were present in the environ- 
ment, both older than the chimpanzee. When the ape was taken into 

the experimenter’s home, the cx- 
" ■ perimenter’s daughter was four 

years old and the son was one 
year of age. Jn general, the find- 
ings in the case of Fin were simi- 
lar to those obtained with Gua in 
respect to such everyday behavior 
as wearing clothing, eating, and 
sleeping. Figure 44 shows the 
animal at play with one of his 
human companions. Erect loco- 
motion was mastered, but the 
structural characteristics of the 
apc\s body made it easier for the 
animal to walk with the aid of 
his arms. Attempts to investigate 
responses to language met with 
failure, a finding whose interpre- 
tation is somewhat ambiguous. 
Fin's motor development was 
close to the norms for laboratory- 
reared chimpanzees, but his per- 
formance on a number of typical 
laboratory learning problems wars 
poorer than that of most laboratory apes. It is apparent that at least 
one reason for this deficiency is to be found in Fin’s poorer motiva- 
tion in such situations. His attitude toward these problems was usually 
playful; he showed little interest in the incentive, was easily distracted, 
and tried to engage the experimenter in play. Such behavior suggests 
that Fdn's superior nutritional condition as wxdl as his experiences with 
humans may have actually made him a poorer subject lor laboratory 
experiments of this sort. Motivational factors w^cre also cited among 



Chimpanzee 
Reared lor d wo Years in a Typically 
Human Environment. (Reproduced 
from unpublished matciial by cour- 
tesy of Dr. Glen I ‘inch.) 


■'"The writers ate indebled to Di. Gtcn T'liieh foi making available the unpublished 
manuscript repoiting this invcMig«itioji. 
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the possible explanations for Fin’s relatively poor showing in the 
understanding of language. 

Routine habits of eating, wearing clothing, and the like were re- 
tained well, as indicated by tests made two years after removal from 
the human home. A vigorously displayed emotional attachment to 
humans and a lack of interest in other chimpanzees also persisted 
upon return to the laboratory. Jn fact, the investigator reports that 
Fin’s friendlier and more intimate reaction to humans was his most 
conspicuous dilTercnce from the laboratory-reared chimpanzees. Re- 
sponses to novel and unusual stimuli also dilTerentiated fun from the 
other laboratory apes, the h’jrmcr showing approach and active manip- 
ulation when confronted with new objects, while the latter exhibited 
fear and withdrawal behavior. 

Till: mi:thod of co-twin control 

Training experiments on human infants by the method of co-twin con- 
trol represent essentially the same approach to the heredity-environ- 
ment question as the studies on infrahuman subjects discussed in the 
preceding section. In such experiments, one member of a pair of 
identical twins is subjected to intensive training in some activity, while 
the other is used as a control subject and prevented from exercising 
the function under investigauon. In one such experiment conducted 
by Ciesell and his co-workers (cf. 19), stair-climbing and “cube be- 
havior” (including prchens'on, manipulaliim, and constructive play 
with cubes) were studied m a pair of identical female twins, 46 
weeks old at the beginning of the exocriment. The trained twin (T) 
was put through a daily 20-minule iraining period in both types of 
acti\ity for six wrecks. At the end of this period, the control twin (C), 
wdio had had no specific training in these functions, proved equal to 
T in cube behavior. In stair-climbing, a dilTercnce was found. Whereas 
1’ was a relatively expert climber, her sister could not reach the top 
of a fivc-tread staircase even with assistance. Two weeks later, how- 
ever, still without any previous traintug, the control twin was able 
to climb to the top unassisted. At this age (53 weeks), twin C was 
herself given a two-w^eek training period, at the end of which she 
approximated T in her climbing skill. Thus, because of the higher 
level of maturational development, a two-week training period at 53 
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weeks of age proved to be nearly as effective as a six-week training 
period at 46 weeks. 

The same pair of identical twins was subsequently put through 
similar training experiments in other functions, including vocabulary 
training (20, 38). Beginning at the age of 84 weeks, twin T was given 
daily, intensive training for five weeks in naming objects, executing 
simple commissions, and other vocabulary-building techniques. Twin 
C was deprived of all opportunity to hear language during that period. 
At the end of the five-week period, when the twins were 89 weeks old, 
similar training was given to twin C for only four weeks. After this 
training, twin C had a vocabulary of 30 words. Twin Ts vocabulary 
at the end of her first four weeks of training had been 23 words, 
although at the end of the total five-week period it rose to 35 words. 
The investigators emphasize the role of maturation in these results, 
calling attention to the fact that the twin whose training was begun 
at the age of 89 weeks progressed more rapidly almost cvcr> day and 
showed a more mature manner of responding at corresponding stages 
of training than the one whose training was begun at 84 weeks. 

It should be noted in interpreting these results that, fir^t, the dif- 
ference in rate of learning betw'ccn the two twins was slight. For 
example, C’s total vocabulary after 27 days of training equaled that 
of T after 31 days of trainin^J (viz., 29 words). Secondly, in vocab- 
ulary, pronunciation, and sentence construction, the twin with five 
weeks of earlier training slightly surpassed the one with four weeks 
of later training, although this difference had largely disappeared 
three months later. Finally, it is obvious that a certain amount of 
structural development in infancy and early childhood facilitates the 
earliest stages in the acquisition of language. The child cannot produce 
combinations of sounds resembling those of adults until his auditory 
and vocal mechanisms permit a certain degree of sound differentia- 
tion and control. Thus ‘^maturational” factors might operate in this 
purely vocal aspect of language development, while the “symbolical” 
or meaningful aspects of linguistic development may well depend 
upon learning. The development of the language function may in this 
one respect be analogous to the development of the pecking response 
of chicks, as analyzed by Cruze (cf. p. 168). It will be recalled that 
Cruze found the effect of maturation to be limited to only one of the 
specific reactions which entered into the pecking behavior. 

Follow-up observations of the same pair of twins through the age 
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of 14 are reported by Gesell and Thompson (19). The authors give 
a detailed summary of the development of these twins from infancy 
to adolescence, covering physical characteristics, motor functions, 
adaptive behavior, language, and personal-social behavior. The twins 
showed a slight dilTerence in developmental rate and in motility, and 
a somewhat larger dillerence in sociability. In their interpretations, the 
authors attribute these diflercnces primarily to innate factors. They 
call attention to certain environmental dillerences — for example, a 
dilTercnce in the stepmother’s attitude toward the two twins — but they 
argue that such dillerences w'crc a result rather than a cause of the 
dissimilarities in the behavior of the twins themselves. Other environ- 
mental discrepancies, such as a pronounced dillerence in the person- 
alities of the teachers which the two twins had from the fiist to the 
loijrth grade, did not, 'according to the authors, create any permanent 
psychological dillerence between the twins. It is their hypothesis 
that dilTcrences in “physiological tempo” and “ontogenetic timing,” 
rc^'iilting irom slight initial differences in the “mechanisms of sym- 
metric regulation,” must liave imposed at least subtle differences in 
the genetic constitution of the two twins ( 19, pp. 105-1 16). 

A number of points should be noted in evaluating this hypothesis. 
First, it is obvious that the authors do not mean “hereditary” when 
they use such terms as “innate” and “genetic,” since the heredity of 
the twins was identical. They arc apparently referring to prenatal 
environmental inllucnces which affected the relative rate of structural 
development of the two subjects Their comparison, then, is between 
such prenatal en\ironmcnial differences and postnatal diifcrcnces in 
training and other aspects of the psychological environment. Secondly, 
the evidence cited is equally consisterL with the hypothesis that such 
postnatal differences account for the behavior discrepancies. Since 
differences in the postnatal environment were admittedly present, the 
question of what was cause and what effect is debatable. Finally, if 
some unknown and undeterminable prenatal factors made the twins 
unlike in structural development, as the authors suggest, then their 
study offers no special advantage over die study of ordinary siblings 
in the analysis of the origin of behavior differences. If both learning 
and structural factors varied, we obviously cannot attribute the 
difference in behavior unambiguously to one or the other influ- 
ence. 

Experiments on intensive training in infancy have also been con- 
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ducted by McGraw (30). A pair of male twins ® were observed from 
birth to the age of 22 months. Jimmy, the twin who appeared stronger 
and better developed at birth, served as the control, his activity be- 
ing approximately that of a normal infant during the earlier period, 
and possibly a little more restricted than normal later. The other 
twin, Johnny, was put through intensive daily training from the age 
of 20 days. Both twins lived at home but were in the laboratory 
between 9 and 5 o’clock for five days a week. The performance of 
the trained twin in each task was compared throughout the period 
of the experiment with that of the untrained control twin. 

Specific exercise was found to have little or no effect upon a group 
of activities including simple reflexes, such as suspension-grasping, 
as well as crawling and creeping, erect walking, sitting, prehension, 
and other sensori-motor functions. Marked improvement resulted, 
however, from practice on a group of somewhat more complex func- 
tions such as skating, jumping, swimming, diving, ascending and 
descending inclines, getting off stools, and manipulating and climb- 
ing stools and boxes to reach an objective. Although a certain amount 
of sensory and muscular development obviously helps in^the latter 
functions, their performance seems to depend largely on specific 
training. The independence of the former group of functions from 
practice confirms many of Geseirs findings. For the execution of 
these simpler functions, the pi^esence of structures of a certain degree 
of development secjns to be sufficient or nearly sufficient. 

In recent years, a few reports have appeared whicli refer to ex- 
tensive research projects being conducted by the method of co-twin 
control at the Psychological Laboratories of the Moscow Medico- 
Biological Institute (27, 28), In one investigation, for example, 5 
pairs of identical twins between 5V2 and 6 years of age received 
IVi months of intensive training in block building by two different 
methods, designated as the ‘‘method of elements'’ and the “method 
of models,” respectively. In the former method, the subjects were 
allowed to sec the individual blocks making up the figure which they 
were to copy. In the latter, the sample block figure was covered with 
paper, thus making the individual blocks indistinguishable. One mem- 
ber of each pair of identical twins was taught by the former, the 

Oiii'inally bclie\cd to be identical, although subsequent physical development 
made this designation unlikely. 
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other by the latter method. Those taught by the ‘'moder’ method did 
more poorly on the test itself, but excelled on other block-building 
tests as well as on a number of other visuahpcrccptual and spatial 
tests. These differences persisted in tests administered 10 months 
after the cessation of training. Such results suggest the possible role 
of work methods in the development of individual differences in per- 
formance, a factor whose importance has been pointed out by other 
writers (cf., e.g., 1, 33). 

FXPrRlMFNTM RrSTRlCTION OF TRAINING 

IN 111! MAN INI ANTS 

With certain limitations, restriction of training — the principal tech- 
nique employed in th^‘ animal studies reported in the first section of 
the present chapter — has been applied in a few studies with human 
infants. 'Ihis procedure is illustrated in an experiment by Dennis (12). 
Tv/o female infants, who happened to be fraternal twins although 
this w'js irrelevant to the present experiment, were reared under con- 
trolled conditions in the expcrimenter\s home from one ti) 14 months 
of age. Dining the fust 7 months of this period, stimulation and 
activity were rigidly restricted, Oppoitunities lor standing and sitting 
were eliminated, and opportunities to grasp objects were highly mini- 
mi/ed. 'flic nursery was bare of all but cssciUial furnishings. Ihc 
cxperiimmters had no social contviet with the children except for 
physical care and for a fev^ k'st made during this period. 'I hey did 
not smile, fiown, speak to. or play with tiie suh|ects. 1 he tw'o infants 
were separated from each (Hher by n^'^ans of an opacjuc screen. 

dhe suliseqiient behavior deve]t)pment of these 'wo children was 
compared with norms e^t ibiished on infants brought up under normal, 
unrestricted conditions. The age at wlucli each of the Two experimental 
children lust pciforrned each of a number of specific activities was 
charted in relerencc to the average and rang o{ ages at wliich such 
activities appeared in the 'hiorm.d” groups. Functions which n('>rmally 
appear during the first seven mouths s K*v\cd no appreciable retarda- 
tion in the experimental sub;ccTs, Among these were such simple 
functions as fixating obicvts: starting, turning the head, or crying at 
a sound: giaspmg objects; watching cr pla>ing with own hands; and 
bringing hand or object to mouth. 1 vidcntly functions such as these 
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appear, regardless of exercise, when the necessary structural develop- 
ment has occurred. Responses which normally occur beyond the sev- 
enth month did, on the whole, show significant retardation, the age 
at which they appeared in the experimental subjects often falling 
beyond the range of the comparison subjects. The investigator reports 
that these responses were quickly established when the opportunity 
for practice was piovidcd, and concludes that their absence had 
resulted from lack of self-directed practice, rather than from lack of 
instruction or of socially administered rewards. 

CULTURAl DIFhFRINCl S IN INFANT-REARING 

PRACllCn S 

A sort of natural “experiment’' similar to that described in the pre- 
ceding section is provided by inlant-cradling practices prevalent 
among certain cultures. It has been the custom in Albania, for ex- 
ample, as in a number of its neighboring countries, to bandage the 
children tightly to their cradles during the first year of life, so that 
they cannot move their arms or legs. The cradle is kept in a darkened 
room, and the child has no contact with toys or other objects. The 
infant is unswathed and bathed once a day, and sometimes Jess often. 

Tests administered to 10 su^h inlants betw^een the ages of 4 months 
and one year (cf. 11) showed considerable behavorial retardation, 
hew reacted spontaneously wheil given the opportunity to do so. 
Coordination was decidcdl) poor, and grasping moxements occurred 
much later than normal. Only one out of the 10 infants was able to 
crawl before the age of one year, although all could sit up without 
support. Social reactions, on the other h<ind, were found to be ad- 
vanced, a finding which was attributed to the presence of large fami- 
lies and to the fact that persons were the only familiar type of stimu- 
lation in the child's environment. Children over one year of age were 
reported to be normal in social reactions, learning abilitv, and “mental 
productivity,” but retarded in coordination and expression. Their 
attitude toward new objects is described as interested and willing, 
though shy and clumsy, the children frequently depending upon adults 
for help in novel situations. 

Restrictive infant-rearing practices are also to be found among 
certain American Indian tribes, such as the Navajo and the Hopi. 
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Among the Hopi, the newborn child is bundled tightly in a blanket 
and tied securely to a stiff board. In such a position, the infant cannot 
move his arms or legs, or even turn his body. For the first three 
months he is kept in these wrappings, except for about one hour each 
day, when he is cleaned and bathed. Dennis (13) reports that despite 
this extreme restriction of movement, when Hopi children are re- 
leased fficy show the same sitting, creeping, and walking behasior — 
and in the same sequence — as white American children. During the 
short daily periods when they arc freed of their wrappings, more- 
over, they assume the usual flexed position, reach for objects and 
carry them to the mouth, reach for their toes and put them into the 
mouth, and exhibit other characteristic motor behavior of an un- 
restricted infant. It is also interesting to note that no significant dif- 
ference was found in this study between the average age of walking 
of Hopi infants cradled in the traditional manner and other Hopi 
children who had been ciadled in the manner of white American 
children. In a group of 63 children reared in the Hopi manner, the 
average age of walking was 14 05 months; for 42 Hopi children 
reared without binding, the average age of walking wa, IS 05. This 
difterence is not statistically significant. 

Ihe results of these two slucfies arc rot in complete agreement, 
the former suggeuing somewhat greater disruption of functions 
through lack of exercise than the latter. The specific results probably 
depend upon a number of factors, such as the nature and degree of 
the restriction and the age at which it is discontinued. The particular 
fmu lions obser\ed also undoubtedlv dilier in their dependence upon 
practice. Additional comparathc data arc obviously needed before 
we can draw^ any definite conclusions. In gcneial, how'cver, such 
studies suggest that ceitain simple, reflex lunctions of early infancy 
depend almost wholly upon structural development, while many 
others require a relatively biicf period of exercise for their successful 
performance." 

cases ba\p hccii rcpoiTcd of American cbil lit‘n who, because of wilful 
nedect on the put of parents had been reared loi st\cial >< ais undei much more 
extreme conditions of rcstiictcd utivity. The .ntcipietaiioii of such ( ises is iliflicult, 
hc’iwever, because of low nuMit Jitv of parents and bcc iiise of Ihc vcr> poor physical 
conihtion of the children rcsultrng tiorn nmlccl (tl 'is, pp 249 ?S()) A thud case 
has been descrtbcd of an infant, neektlcd mcdicaJiy and psychologically, who had a 
Kuhimann-Binet IQ of 29 at the age of 19 months. The IQ lose steadily following 
institutional caie and training, and seemed to have reached a stable le\el at 97 by 
the age of 6 (29). 



182 Differential Psychology 

CASE REPORTS OF “FERAL MAN” 

A much more extreme type of “natural experiment” is furnished by 
the cases of children found to have been living in isolation or in ex- 
clusive contact with lower animals. Such “wild children” have been 
described since early historic times. In 1758, Linnaeus included them 
in his classification of the human species, under the designation of 
“feral man.” An extensive survey of recorded cases has recently been 
prepared by Zingg (41, 42). Over forty cases arc described, although 
in a number of them the available information is quite meager or the 
isolation was only partial. 1 hese wild children include a few who had 
apparently been abandoned or had wandered off and survived in the 
wild largely through their own efiorts, as well as a number who seem 
to have been reared by such animals as the wolf, bear, goat, pig, 
sheep, cattle, and leopard. Children who have been confined in isola- 
tion from human contacts and have been living under conditions 
barely sufficient for survival are also included in this category . 

These cases of wild children have been of special interest to psy- 
chologists because of the possible light they ma) throw upon the 
question of how far normal human behavior develops in the aj^sence 
of normal human stimulation. In his summary of the recorded cases, 
Zingg (41 ) concludes that such wild children were, without exception, 
mute and quadrupedal. No vocalization lesembling human speech 
developed under these circumstances, and the characteristically hu- 
man, erect locomotion was not found. All had developed some form 
of locomotion on hands and feet or on hands and knees, and their 
physical structure had often become modified (by the appearance of 
calloused pads, etc.) to permit rapid and efficient quadrupedal 
locomotion. 

Characteristic sensory modifications arc also reported, the senses 
of smell, hearing, and sight — especially night vision — often showing 
an animal-like keenness. I ating habits arc markedly unlike those typi- 
cal of the human. Raw meat is the common diet among children 
reared by carnivorous animai^; wild-living children arc described as 
subsisting largely on bark, roots, grass, herbs, and leaves. One “wild 
girl” in France had become very adept at swimming for fish and 
frogs, which constituted her principal food. The pattern of eating 
behavior is also similar to that of lower animals, including the smell- 
ing of food before eating, lowering the mouth to the food, sharpening 
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teeth on bones, and the like. There is no evidence of any tendency to 
cover the body or to devise “clothing” of any sort. Such children seem 
to have been relatively insensitive to heat and cold and to have de- 
veloped no “sense of shame” from nakedness. No crying, tears, or 
laughter was observed, although other expressions of violent anger 
or impatience are reported. Expressions ol sex interest and activity 
were either completely absent or present only in the form of diffuse, 
general, undirected activity. No “consciousness of kind” or gregari- 
ousness was evidenced, the children shunning humans and often 
showing preference for tlie company of lower animals. 

These reports of wild children have been seriously questioned by 
some psychologists, such as Dennis (14). It is undoubtedly true, as 
the records themscKes show, that in several of the cases summarized 
by Zingg the association with animals either began after the child 
had reached an advanced age in human contact, or such association 
was only partial, the child still remaining in some contact with human 
adults. It is also true that the data on some of the cases, especially 
the earlier ones, are so meager and so subject to the inaccuracies 
and bias of the original observers as to be of doubtful authenticity. 

Some wTiters (ef., e.g,, 14) would go further, however, and pro- 
pose an alternative explanation for all eases of w^ild children. They 
maintain that such children may have been feebleminded to begin 
with, which would account for their abandonment in certain cultures. 
1’heir lack of typical human behavior, such as language and erect 
locomotion, as well as their other “animal-like” characteristics, arc 
then attributed to their oririnal mental defect. The usual counter- 
argument is to ask: “How, then, could such a feebleminded child 
have managed to survive in an environment which would tax the in- 
genuity of even a normal adult?” In answer to this, Dennis ( I4) has 
proposed the possibility that the wild children may actually have been 
abandoned only a short time — perhaps only a few days — ^bef('»re they 
were found, and that their behavior deficiency was incorrectly inter- 
preted as a sign of prolonged isolation. 

In a reply to Dennis’ critique, Zingg (42 ) calls attention to the 
structural changes following prolonged four-footed locomotion, as 
well as to the degree of proficiency attained in such locomotion, con- 
ditions unlikely to have developed if the child had been living in 
human society until a short time prior to his discovery. The marked 
progress in learning human ways made by a number of such children 
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is also mentioned as evidence against the hypothesis of initial feeble- 
mindedness. To be sure, one rarely finds the performance of such 
subjects after training to equal or even approximate that of normal 
children of the same age. This is hardly to be expected, however, not 
only because of the long period of isolation during which opportuni- 
ties for acquiring normal human behavior were lacking, but also 
because of the interference or “negative transfer’' from other modes 
of behavior which have been acquired in the wild and which must be 
“unlearned” before progress can be made. By way of contrast, Zingg 
calls attention to one case of a “wolf-boy” oi India who was appar- 
ently a “true idiot.” This boy showed virtually no progress subsequent 
to his capture, although he lived well into adulthood. The compara- 
tively greater progress made by the other wild children suggests that 
they may have been structurally normal and rendered deficient only 
by their early stimulational deprivation. 

Zingg also cites the reports of reliable eye-witnesses indicating 
that at least two wild children (the wolf-children of Midnaporc, to be 
reported below) had actually been living with wolves for some time 
prior to their capture. Iliis is emphasized by Zingg in reply to Dennis' 
argument that no dirext evidence is available to show that human 
children have in fact been reared by animals. Dennis suggests that 
when children are captured in the company of animals, they may 
simply have been accidentally brought together by their common 
cflorts to hide from a pursuer. Dennis stresses the importance of this 
point for the interpretation of the behavior of wild children. He points 
out that if a cluld is abandoned before the age of about three, he 
cannot possibly survive in the wild unless “adopted” and cared ior by 
an animal. On the other hand, if the child was over three at the be- 
ginning of his vild existence, then he would already have acquired at 
least the rudiments of human speech, locomotion, and similar func- 
tions, unless he were cimgenitally defective. It thus becomes of crucial 
significance to establish the possibility that, at least during their 
earliest, helpless years, such children could have been nurtured by 
animals. 

The objections raised by Dennis (14) and others must be carefully 
weighed in interpreting any report on allegedly wild children. Such 
objections may wxdl hold for a considerable number of the cases cited 
by Zingg. On the other hand, the available evidence strongly suggests 
that at least three or four cases are well-authenticated, genuine in- 
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stances of prolonged isolation from human contact. One of the most 
intensively studied cases is that of Victor, the Wild Boy of Aveyron 
(23). In September, 1799, three sportsmen came upon a boy of 11 
or 12 in a French forest. The boy was completely naked, unkempt, 
scarred, unable to talk, and seemed to have been leading a wild, 
animal-like existence. He was seized by the men as he vvas climbing 
a tree to escape their pursuit, and was subse(|uenlly brought to civili- 
zation, where he finally came under the guidance and observation of 
the French physician hard. The very illuminating account which 
hard published on his own findings has immortalized the Wild Boy 
of Aveyron. 

When found, the boy seems to have been deficient in all forms of 
b havioral development, including sensory, motor, intellectual, and 
emotional. This is clearly brought out in the following description 
given by hard (23, pp. 5-8): 

His eyes were unsteady, cvprcssionless, wandering vaguely from one 
object to another without resting on anybod> : they were so little experi- 
enced in other wavs and so little trained by the sense ot touch, that they 
never distinguished an object in lelief Irom one in a pictuie. His organ of 
hearing was equally insensible to the loudest noises and to the most touch- 
ing music. His voice was reduced to a suite ol complete muteness and only 
a uniform guttural sound escaped him. His sense of smell was so uncul- 
tivated that he was equally indifiercnt to the odor of perlumcs and to the 
fetid exhalation of the dirt with which his bed was filled. Finally, the organ 
of touch was restricted to the mcebanicai function of grasping objects. 
Proceeding then to the state of the intellectual tunctions of this child, the 
author ol the report presented him to us as being quite incapable of 
attention (except for the objects of his needs) and consequently of all 
those operations of the mind which attention involves. He was destitute 
of memory, of judgment, of aptitude lor imitation, and was so limited in 
his ideas, even those relative to his immediate needs, that he had never yet 
succeeded in opening a door or climbing upon a chair to get the food that 
had been raised out of reach of his hand. In short, he was destitute of all 
means of communication and attached neither expression nor intention to 
his gestures or to the movements of his body. He passed rapidly and with- 
out any apparent motive from apathetic melancholy to the most immod- 
erate peals of laughter. . . . His locomotion was extraordinary, literally 
heavy after he wore shoes, but always remarkable because of his difficulty 
in adjusting himself to our sober and measured gait, and because of his 
constant tendency to trot and to gallop. He had an obstinate habit of 
smelling at anything that was given to him, even the things which we con- 
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sider void of smell; his mastication was equally astonishing, executed as 
it was solely by the sudden action of the incisors, which because of its 
similarity to that of certain rodents, was a sufficient indication that our 
savage, like these animals, most commonly Jived on vegetable products. 

It is interesting to note that the sensory deficiency of this boy seems 
to have been quite specific and in many instances directly traceable to 
his mode of life. Thus Itard observed that “the sound of a cracking 
walnut or other favorite eatable never failed to make him turn 
around . . . nevertheless, this same organ showed itself insensible to 
the loudest noises and the explosion of firearni>” (23, p. 15). Sexual 
development showed the same general undifferentiated type of re- 
sponse observed in the case of animals reared in isolation. Following 
the onset of puberty, periods of vague restlessness and discomfoit as 
well as occasional fits of sadness or anger were noted, without, how- 
ever, the development of specific, normal sexual activity. 

After five years of ingenious, painstaking, and methodical training, 
Itard abandoned the task because he had failed to bring the boy up 
to a normal level of performance. It is significant, however, to note 
the degree of improvement which was cllectcd during this period. 
Itard himself writes: “But if one limits oneself to the two terms of 
comparison offered by the past and present states of young Victor, 
one is astonished at the immense space which separates them; and one 
can question whether Victor is not more unlike the Wild Boy of 
Aveyron arriving at Paris, than he is unlike other individuals of his 
age and species'’ (23, p. 53). Besides learning many routine activi- 
ties of a civilized community, including eating habits, dressing, per- 
sonal care, and the proper use of common articles of furniture, Victor 
showed considerable progress in the identification and discrimination 
of objects, the formation of simple abstract concepts, and other in- 
tellectual tasks set by his tutor. Although unable to articulate sounds, 
he succeeded in learning to communicate through written language, 
being able through this medium “to express his w^ants, to solicit the 
means to satisfy them and to grasp by the same method of expression 
the needs or the will of others” (23, p. 84). Evidence of considerable 
development in emotional responses and in social and “moral” atti- 
tudes is also cited by Itard. 

The more recently discovered “wolf children” of Midnapore (18, 
24, 36, 37, 41) represent another well-authenticated case. In 1921 
tw'O girls, one approximately two to four years of age and the other 
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eight or nine, were found living in a cave with w^olves in a sparsely 
settled region of India. They were taken into a local orphanage, 
where attempts were made to train them. A detailed diary of the girls’ 
activities, kept by the rector of the orphanage, is now available in 
published form (35), together with analyses and comments by sev- 
eral psychologists, a sociologist, a geneticist, and an anthropologist. 
It proved dilhcult to keep the girls in good health, particularly be- 
cause the readjustment to a normal human diet led to physical 
debility and severe skin reactions. The younger girl, “Amala,” died 
within a year; the elder, “Kamala,” In'ed for about eight years after 
her discovery. Most o\' the observations on record arc thus necessarily 
based on the elder child. 

Kamala, like her younger companion, showed a strong preference 
for raw meat, and was lond of pouncing upon any tieshly killed 
animal which she lound. Displa}ing a keen, animal-likc sense of 
smeik she was able tc< delect the odor of meat Irom a great distance. 
Hearing was also very acute. Her c}es are described as having a 
peculiar glare, like llie eyes of dogs or cats in the dark. It seemed that 
Kamala could see better at night than in the daytime, and she seldom 
slept alter midnight. It has been suggested (18) that tiie vitamin in- 
tal e in her diet may ha\e favored chemical changes in the e>e wdiich 
im[)roved vision in dim light. I ating tind drinking were accomplished 
by low^iiiig tlic mouth into the plate. In gencial, hei mouth, rather 
than hei hands, served as a piehensilc organ, bventually she was 
taught to use her hands in eating. 

As in the case of othci feral children, locomotion was quadrupedal. 
Kamala walked on hands viiid knees lor slow locomotion, and on 
hands and feel for running. She was able to run so last by this method 
tliat it proved dilhcult to overt.ike he. I hick caHo'^itics hvid developed 
at the knees, elbows, soles, and palm , undoubtedly as a result of such 
locomotion. It was not until siv years later that she finally adopted 
erect walking, altiiough cv.n at that time she would revert to the 
iormei loui-k)oted iLchiuqac when running. Her only \ocali/ation 
at the time of diseoveiy consisted ol a ct\ (>r hovv] which bore a ccr- 
lain resemblanee to the t>pieal vvo‘i\r». With piolonged training, she 
was finally able to say about t(^»-|y-live words and to torm simple 
sentences of two or three w'cvds Ccicti. 

Mention should also be made of the celebrated and mysterious 
case ol Ka^par HauscT (cl 3.^, pp. 277-36,^), abc'ut wliom so much 
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has been written. Some accounts suggest that this boy was an heir 
to a princely house and was put out of the way by political enemies. 
He was apparently confined from early childhood in a dark cell, not 
large enough for him to stand upright. No clothing or cover was fur- 
nished except a shirt and trousers. When he awoke, he was accus- 
tomed to find bread and water, but he never saw the person who 
brought them and he had no knowledge of the existence of other 
living creatures besides himself. He was released in 1828, when about 
17 years of age. At this time he was first discovered wandering aim- 
lessly about the streets of Nuremberg. He could not talk, but repeat- 
edly uttered certain phrases meaninglessly. He is reputed to have had 
a remarkable sense of smell and a surprising ability to see in the 
dark. His walking resembled the first efforts of a child. After various 
vicissitudes, his instruction was undertaken by a skillful and pains- 
taking teacher. Under the latter’s tutelage, Kaspar Hauser made rapid 
progress and soon learned to speak. By this means he was able to 
communicate what he recalled of his life in the cell as well as his 
experiences during his period of instruction. Unlike other cases of 
children brought into contact with civilization relatively late in life, 
Kaspar Hauser profited sufficiently from his education to reach and 
even possibly surpass normal achievement. 

To psychologists, probably one of the most interesting observations 
made in the coutse of Kaspar Hauser’s training is one pertaining to 
the development of space perception."' vShortl} after his release, Kaspar 
Hauser was asked by his tutor to look through a window which gave 
a view of a pleasant landscape. The boy is reported to have withdrawn 
in horror, saying, “Ugl}, ugiy’” At a latei date, when questioned about 
this incident, Kaspar Hauser explained it as follows: 

Yes, indeed, what T then saw was very ugly For when I looked at the 
window It alwa>s appealed to me as if a window-shuttei had been placed 
close before my c>cs, upon which a wall-paintcr had splattered the con- 
tents of his different brushes, filled with white, blue, green, yellow, and icd 
paint, all mingled together Single things, as I now see things, I could not 
at that time recognize and distinguish from each other. T his was shocking 
to look at; and besides, it made me feel anxious and uneasy; because it 

®This incident is reported in the contemporary account wntten by Paul J 
Anselm, Ritter von leuetbaLh, Kispn llausei s piotectoi and friend. This account, 
which is considered to be the most authentic source of mfoiination on the case, was 
translated into English m 18'!'^ by H G I inberg and is icpnnted in full n Zinggs 
book (1^ pp 277 365). 
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appeared to me as if my window had been closed up with this parti- 
coloured shutter, in order to prevent me from looking out into the open 
air. That what I then saw were fields, hills, and houses; that many things 
which at that time appeared to me much larger, were, in fact, much 
smaller, while many other things that appeared smaller were, in reality, 
larger than other things, is a fact of which I was afterwards convinced by 
the experience gained during my walks; at length 1 no longer saw anything 
more of the shutter (35, p. 323). 

In a further quotation from the same account, we arc told: 

To other questions, he replied, that in the beginning he could not dis- 
tinguish between what was really round or triangular, and what was only 
painted as round or triangular. The men and hoises rcpicseiited on sheets 
ot pictures, appeared to him piccisely as the men and horses that were 
carved in wood; the first as round as the kilter, or these as flat as those. 
But he said, that, in the packing and unpacking ot his things, he had soon 
felt a diflerence; and that afterwards, it had seldom happened to him to 
mistake the one for the other (35, p. 32'^). 

These examples of '‘wild children*’ illustrate the close dependence 
of human development upon the environment in vthich the subject 
is reared and the t>pe of stimulation to which he is exposed. If a 
child is deprived of nr^rmal human contacts, his behavior will come 
in resemble in many ways that of a low-grade idiot. Such a condition 
has, in fact, been regarded as a sort of environmental feebleminded- 
ness and has been given the name of isnUuion amentia (cf. 39, pp. 
202 297). When a child is brought up in contact with animals, strik- 
ing similarity to the behavi. ot those animals is exhibited, and such 
behavior proves difticult to eradicate once it has beu)me firmly estab- 
lished. Subsequent educational effn is arc inadetiualc to undo the 
cllccts of early nurture. Rousscau*s d’cam of the "iinble savage” w'hosc 
inner nature is allowed to develop, free and unhampered by human 
interference, proves to be a vain chimera. The situation has been 
aptly summarized b} Stratum (37, p. 597): 

I ack of association with adults duiing a certain ciitical period of early 
childhood, it seems likely, piodiiecs in s me or all normal childicn marks 
like those of congenital delect. The evidence seems against the lomantic 
view that a civih/ed community is a chief obstacle to the development of 
personality. On the cv)ntrary, the higher forms of personality become pos- 
sible only in and through such a community. By our biological endowment 
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An understanding or the dlgri e to which scores on a psycholog- 
ical test can be raised by practice or by coaching is obviously essential 
for the proper use and interpretation of such a test. Apart from this 
purely technical consideration, however, the investigation of such 
practice effects is relevant to the role of stinuilational dilfercnces in 
behavior development. This type of investigation has been treated by 
some psychologists as another approach to the pervasive question of 
heredity and environment.^ 

Theoretically, the interpretation of the elleets of practice or coach- 
ing upon mental test performance presents several possibilities. Thus, 
some have aigued that if psychcdogical test performance should prove 
relatively impervious to improvement through practice or coaching, 
then the scores on these tests ma> be regarded as indices of “native 
ability” or '‘potentiality.” If on the other hand, the tests do prove 
susceptible to such influences, then it might be argued cither that 
these particular tests arc unsuitable as measiiics of “native capacity,” 
or that the concept of “native capacity” should he redefined or dis- 
carded. However inconsistent some of these interpretations may be 
with the nature of heredity and envin iment, as discussed in Chapter 3, 
it is necessary to keep (hem clearly before us, since ali such inter- 
pretations have been expressed in the highly controversial literature 
of this field. Failuie to recognize the shades of difference among these 
views has added to the confusion in discussions of heredity and 
env ironment 

A further question which has htv. ' raised regarding the role of 
practice or coaching relates to the petnianence of the effects. As con- 

^ Cf. the seventh method h^red in Chaptci 4. 
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ccived by some writers, this question pertains to whether the “under- 
lying course of development'’ may be altered by environmental condi- 
tions or whether such conditions exert only a “superficial” and “transi- 
tory” effect Still another way in which the results of such expenme nts 
have been analyzed is m terms of the effect of practice upon the extent 
of indiMcliial differences. Do individuals become more alike, or do 
they become more unlike each other v\hcn subjected to a unilorm 
period ol practice"^ Ihe implications ol the answers to this question 
have also been disputed at length, and will be considered in the clos- 
ing section of the present chapter 

mr TirrcTsoi practk r upon pi rf oraianci lfvfl 

All the investigations to be considered iii the present section were con- 
ducted on children of school age Crates (7) studied the effect of con- 
tinued practice upon nnuion span fen dn^its Two groups ol school 
children selected so as to be equivalent in age, number of bo)s and 
girls, Slanford-Binct 10, school grade, teachers' estimates ol scholastic 
malunt\, and scoies on several memory tesis, weie giyen an initial 
test in digit span The children in the Practice group weie then put 
through mdiyidinl practice in lecalhng dicils on each ol 1£ days 
extending over a period ol five months At the end of this period, both 
Practice and Contiol groiip*- were given a fin il test 1 he average scores 
on initial and final tests arc reproduced in I able 4 

T \B1 [ 4 ! hi I fftc^s i f Puiifuc upon Me wots Span for Dibits 
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Both giOLips show improvement, but the Practice group is clearly 
ahead, manilesting a gam which normally requires a six->ear peiiod, 
according to the Stanlord-Binct norms for this function Four and cne- 
hali months after the final test, both groups were again tested, by a 
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different examiner. This time the Practice and Control groups were 
approximately equal. Finally, the two groups were subjected to 22 days 
of practice, at the end of which boih showed improvement and in 
approximately equal degree. It was also found that the training in 
digit span had no effect on performance in other types of mental tests. 
From these findings, Gates concludes that training is highly specific, 
consisting in the acquisition of special skills and techniques, and that it 
docs not alter the growth of the “underlying mental functions.'” 

It is unfortunate that, in most studies on experimentally administered 
practice, all effects not directly resulting Iroin such practice arc attrib- 
uted indiscriminately to growth or maturational phenomena, the influ- 
ence of the v;ist amount of other training which the child is receiving 
in the course of everyday life being disregarded. 1 has in Gates’ experi- 
ment, the improvement of the Control group could just as well be 
attributed to intervening experiences as to growdh. The data them- 
selves, to be sure, do not permit a choice between the two explana- 
tions. That experience may have been the important factor is, however, 
suggested by the fact that both groups drop tt^ an equal level when 
retested later by another examiner. Jhc drop may have resulted from 
the time of year at which the tests were administered, or from other 
factors incidental to the school situation. The rlc)scness with which the 
child attends to the material and the effort he puts forth to concen- 
trate on the task of memorizing are very important factors in deter- 
mining his span; and it seems entirely plausible that such factors 
should be influenced both by the attitude of the particular examiners 
and by the sum total of school experiences which the child has had. 
It is noteworthy that the 4*^-month period preceding the drop in score 
included the summer vacation, which is definitely an environmental 
and not a maturational incident. 

The marked susceptiiiility of a function like memory span to train- 
ing, which this experiment demonstrated, seems in itself to minimize 
maturational factors. To a.vsume the existence of some underlying 
hypothetical capacity of memorizing which remains unaltered while 
performance on a memory span test rises and falls seems totally un- 
warranted by the facts and certainly cv>es not clarify the problem. 

The effect of repetition upon intelligence test scores has also been 
investigated. Rises in score have been regularly reported when the 
identical test is repeated within periods ranging from a few days to a 
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year. In a survey of 614 school children, for example, Adkins (1) 
found that children taking three common group tests for the second or 
third time at intervals of a year obtained higher scores than children 
of corresponding school grade who were taking the tests for the first 
time. Similarly, in the Harvard Growth Study (6), the median IQ in 
successive years rose from 102 to 113 when the same group intelli- 
gence test was repeated,- but dropped again to 104 when another 
group test was substituted. In the report of this study, it is pointed 
out that the meaning of an IQ on repeated tests may change consider- 
ably. Thus, for example, an IQ of 100 may correspond to the 47th 
percentile on the first testing and to the 17th percentile on a subsequent 
testing (6, p. 1 34). In other words, an IQ which, if obtained on a first 
test, would indicate approximately average ability, on a retest might 
signify ability in the lowest quarter of the distribution. In an analysis 
of the Stanford-Binct IQ’s of children in the same study (cf. 5), re- 
tests administered within six months showed an average gain of four 
to five IQ points. 

Gains in score have also been fouml upon the administration of 
parallel forms of the same test, although such gains tend in general to 
be smaller. Fcrman and Merrill (21) report an a\eragc increase of 
approximately 2.5 IQ points when Form L of the revised Stanford- 
Binet was followed by Form M, or vice \ersa, w illiin a few days. In the 
Minnesota Preschool vScalc, it is suggested that 3 IQ points be de- 
ducted as a correction for practice elfcct when alternate forms arc 
administered within a few weeks (cf. 8). 

E. L. dhorndike (23) gave alternate forms of a group intelligence 
test to several groups of high school, college, and giaduate students. 
The two forms were administered in immediate succes.don and were 
preceded by a lO-minutc fore-exercise on similar items. The forms 
were used in re\crse order in dificrent groups so as to cancel any 
existing differences in he difficulty of the parallel forms. The average 
gain in score on the second form was approximately 8 points for the 
various groups tested. In an earlier study in which the fore exercise 
had been omitted, the average gain had been slightly over 12 points. 
In a further investigation with the same lest, 15 equivalent forms w^erc 

-The authors icpoit thal the j^ains lessened in later Inals, because the brighter 
subjects weic leachiuR the uppci limits of perfoimance aitihcially set by the lesl 
ceiling Thus the gams would piesumably have been larger if tests with higher ceilings 
had been employed 
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administered, one each day, to 20 gifted and 19 normal children, all 
approximately 11 years old and attending the same school. The mean 
of the gifted group rose from 87.5 to 111. The largest gain occurred 
on the second trial, and the highest mean score, 1 15, was reached on 
the 10th trial. Subsequent trials showed only minor fluctuations. In the 
normal group, the mean rose from 51 to 86.5, the largest gain again 
occurring on the second trial. Subsequent trials showed smaller fluc- 
tuations, in either direction, with the highest mean score falling on the 
13th trial. 

Some evidence is available suggesting that a slight improvement also 
occurs upon successive retests with different intelVn^cnce tests, Rodger 
(18), for example, gave six group intelligence tests to 76 children, 
aged 1 1 and 1 2, with an interval of two weeks between each successive 
test. The childicn's average TQ rose from 101.9 on the first test to 
109,8 on the sixlh.''^ Although this practice eflccl was found at all 
ability levels, the brighter children, in general, showed larger gains. 
For example, the 95lh percentile rose from 123.5 to 138.5, whereas 
the 5th percentile rose only from 81.2 to 83.0. Another investigator 
(6), however, iound no such practice effect from one intelligence test 
to another, the improvement being specific to the particular test. Un- 
doubtedly the “spread” of the praetiee eficct will djllcr with such fac- 
tors as the degree of simikuity of the tests, and the age, education, 
and previous “test-wiseness” of the subjects. 

A qualitative anahsis of the changes occurring when tests are re- 
peated was undertaken by Greene ( 1(), 11). Groups of from 19 to 235 
college sophomores were given four trials of each of 14 tests at inter- 
vals of one day, not all tesu being employed with any one group. A 
wide range of fairly specific functions was covered by the tests, which 
included the Seashore musi*.al discr mination tests, tapping, aiming, 
pencil ma/es, digit span, feature coiiiparison, spaced of reading, equa- 
tion completion, vocabulary, Kohs Block Design, Stenquist Mechanical 
Assembly, and Minnesota Spatial Relations. An analysis of the test 
scores, supplemented by observations of performance and introspec- 
tive reports, led the author to conclude that the qualitative changes 
in procedure correspond closely to th^ differing amounts of improve- 
ment on the various tests. Those tests which showed little or no 

'’These scoies were adjusteH for possible difliTcnces in norms resulting fiom the 
fact that different tests hid been standardi7cd on diffeient populations 
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improvement with repetition depended upon processes which change 
little with practice; such tests were performed in like manner on initial 
and subsequent trials. Tests showing large practice effects, on the other 
hand, seemed to depend upon different processes or work methods 
when repeated, some processes being eliminated and new ones intro- 
duced. 

Tests involving primarily speed of ballistic movement (e.g , tap- 
ping) or limmal discrimination (eg, the Seashore tests) showed the 
smallest practice eflect, the impiovemcnt ranging from 0 to 5 per cent 
of the initial scores, lests depending upon precision of movement 
(c g , aiming) or upon specihc preliminary intoimation (e g , vocabu- 
laiy ) showed increases ol 6 to 25 per cent. At the other extreme, in- 
creases of 76 to 2()(j per cent were found in such tests as the Kohs 
Block Design and the pencil mazes, in which a generalized rule or 
principle could be learned during the performance oi the test. In- 
cicases of more than 300 per cent occuned in those tests in which a 
solution or partial solution could be recalled and applied directly in 
subsequent trials, <is in the Stenquist AssembK lests ‘ 

It IS apparent that, at least in certam t\pes ol tests, repetition ma> 
produce a change in the natuic ol the test Ihus a test which on its 
fust administiation measures arithmetic leasonini! or mechamctil apti- 
tude ma>, upon repetition, bee'ome prim.iril) a test ol memoij^ and 
speed On latei tiials the subiext need onl) recall and exeviile the tin- 
ished solutions which he worked out diuing the initial tnal It nnght 
be ciigiKcl that this is simply a t)roblerii of test adminisiration and has 
no bearing upon the development ol beha\ior Obviou^’]/ tlie noims. 
validit\ and relhibilit}/ determined loi the initial trial ol such a test 
would be inapplieahle to the scoies cdUained upon lepeated testing. 
Repetition ot such a test might be said to “spoil" U as a divignostie 
instrument \ecoid ng to this \kw it is only the me<isuimg indui- 
ment which is allected and not the behavu^r which it is designed to 
measure, 

Jn evaluating >uch an inteiprc tation, it should be noted, first, tliat 
an> diiholonw between ‘test behjMor' and ‘Tinder]\ing behavior 
functions” is ini^’eading and inconsi^ent with the logic of test con- 
struction. I very psjcholomeal test necessarily samples bchaMor func- 

should be noted of eoMist ih U these dlTdtnces in pci ecnl mn n»iv 'tsiilt 
in put lieun djlhienecs in the loe. lion ul the dbili iiv /cios of ihe \ai o s t^sts 
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tions. Any influence which affects lest performance, therefore, may 
also affect performance outside of the test situation. If repetition of a 
test leads to marked improvement, then repetition of similar activities 
in everyday life will probably lead to marked improvement in the per- 
formance of such activities. 

Secondly, if repetition of a test alters the nature of the behavior 
being sampled, because different work mclhods are employed in tak- 
ing the test before and after practice, then the role of work methods 
ought also to be considered in comparing the initial performance of 
different individuals. For example, individuals whose previous experi- 
ence includes the solution of many arithmetic problems dealing with 
amount of money spent and saved out of weekly earnings, or number 
of pencils which can be bought for a given amount of money, will rely 
more heavily on memory and routine solutions, and less heavily upon 
reasoning, in taking a test which consists of such problems. The reverse 
will be true of individuals without such pre\ious experience. This is 
even more apparent in such tests as mechanical assembly. On a test of 
this sort, the initial performance of a person who has frequently taken 
apart and put together bells, clocks, latches, and other mechanical 
gadgets may be more nearly comparable to the third trial performance 
of a mechanically iiicxpeiicnced individual than to the latter’s first- 
trial performance. 

In other words, when a test proves to be markedly susceptible to 
practice effect, the behavior which it measures is probably susceptible 
to practice effect in everyday life, to a coi responding degree. Indi- 
vidual differences in such b^'hav^or may therefore result largely from 
such differences in previous experience. This docs not mean, of course, 
that repetition of the test — or any other factor which raises test per- 
formance — w'ill in itsell improve the behavior area which is being 
sampled. Ihus it would be absurd to expect that the repetition of a 
particular mechanical aptitude test, which raises the subject’s score 
from the 40th to the 70th percentile, has increased his general mechan- 
ical aptitude by that amount. The subject’s mechanical aptitude has 
been raised only for the small sample of tasks included in the test and 
any similar tasks to which he can ap^’iy the specific procedures or 
principles he has thus learned. But such a rise in score docs suggest 
that similar practice in daily life might raise performance in the 
broader area which the test is sampling. 
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THE INFLUENCE OF COACHING 

A number of investigations have been conducted to determine the 
effect of specific coaching upon intelligence test scores. The 1916 revi- 
sion of the Stanford-Binet has probably been more thoroughly ex- 
plored in this legard than any other test. In three studies (4) carried 
out under the direction of Terman at Stanford Um\ersity, children 
were given instruction and practice for several weeks on material 
either identical or similar to some of the tests in the Stanford-Binet 
scale. The groups were small, varying from 10 to 26, but in each study 
the trained group was carefully matched with a contiol group by 
“pairing'’ the subjects. All experiments cle«irly demonstrated the pos- 
sibility of teaching a child to perform tests which he was formerly 
incapable oi doing because of age or mental level. The influence of this 
improvement upon the 10 obtained on the whole scale dillcreJ in the 
three studies, being most evident, as would be expected, in that study 
in which the trained functions overlapped with the largest number of 
Stanford-Binet tests. In this study, furthermore, retests after a ^i\- 
week period, during which neither group had received any training, 
showed the practice group to have retained its advantage over the 
control group. 

A more extensive investigation on the effects of coaching is reported 
by Greene (9, 12). Three groups of children were given the Stanlord- 
Binet. 1 he subjects in one of these groups were then coached on the 
specific tests m which they had tailed. A second group was coached 
on material similar but not identical to that in the Stanford-Binet. No 
child in ehher ijroiip was coached for over t\^o hows altoi^etlur, al- 
though the training was distributed over a period of two weeks, fhe 
third group served as a control, receiving no special training in the 
test nLitenal. All groups were retested at intervals of three weeks, three 
months, one year, and three years alter the initial tests. The average 
lO's of each group on the initial test and on each of the four retests 
arc given in 1 able 5. The results obtained in two schools, A and Y, 
have been kept separate since a slightly different method of coaching 
was employed in each. A total of 153 second grade school children 
served as subjects during the first year of the study, but only 83 could 
be reached in the three-year retest. The data reproduced below are 
based only on the subjects tested over the entire three-year period. 

^ 1 e , the study bv Casey (4, pp 4'11 4'IS) 
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TABLE 5 The Effects of Coaching 
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II will he noted that whereas the control groups manifest only irreg- 
ular fluctuations from time to time, the coached groups in both schools 
show marked improvement on the second test, wJiich followed shortly 
after the coaching period. This improvement is retained on successive 
retests, although m constantly decreasing amount. Ihe gradual drop 
in IQ obseived in the ct>achcd groups may be attributable partly to 
forgetting of the coached material and partly to the fact that, as the 
children grew older, they were tested to an iiKreasing extent at higher 
age levels ifi which thry had not been coached. 

That the latter is probably the major factor is demonstrated by a 
comparison of the coached groups in the two schooK. In school ^ , the 
children were coached more intensively on fewer tests; in school Y, 
they were coached on two additional higher levels. Thus the effects of 
coaching in school Y should not be ‘‘outgrown” as readily as in A. 
The average IQ’s do in fact shov larger and more lasting effects of 
coaching in school Y. The groups trained on similar material also 
show an immediate improvement, which gradually disappears on suc- 
cessive retests. As would be expect, k the gains in these groups are 
much smaller throughout than in the groups which had been directly 
coached. 

All these studies indicate the great extent to which mental test per- 
formance may be influenced by tiaining. Such findings suggest vast 
possibilities regarding the p irt played by the incidental and often acci- 
dental training of everyday life. Tha’ d^e effects of a brief period of 
training arc not permanent seems to be quite beside the point. When 
training is discontinued, we should naturally expect the improvement 
to fall off because of forgetting If, furthermore, children are tested in 
different functions at successive ages, as they are to a large extent in 
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the Stanford-Binct, the effects of training will not be manifested over 
a Jong period. It is futile to expect that a brief period of highly specific 
instruction or practice should raise the “general mental level” of the 
child, especially since such a mental level is itself a manifold of widely 
diverse and loosely interrelated functions. Training docs have a very 
real effect, however, upon the individual’s performance on specific 
mental tests. And this is ol prime importance since all our observations 
regarding the subject^ psychological make-up arc ultimately derived 
from such concrete behavior. 

THE PROBLl M OF PRACTICE AND INDIVIDUAL 

Dll FLRLNCI S 

Since it has been demonstrated that training can bring about a pro- 
nounced change in mental test performance, a further question may be 
raised regarding the differential effects of such training upon individual 
subjects. Will the initially better individuals benefit more than the 
initially poorer? Will subiects tend to maintain the same relative stand- 
ing in the course of training? Do individual differences increase or 
decrease with practice? 11 these questions are still unanswergl, it is 
not for dearth of data, lor they ha\c been repeatedly investigated with 
a wide variety of materials, methods, and subjects.*’ The entire problem 
is so beset with technical difficulties, however, as to ha\e c\en been 
declared insoluble by some. The crux of the matter is that entirely op- 
posite conclusions can be diawn if the results arc expressed in differ- 
ent forms, a fact which has cast an aura of artificiality over all the 
data. 

In the present section, we shall examine briefly some of the major 
issues involved in the prc>blem of practice and variability. These must 
be considered before any attempt can be made to interpret particular 
findings. The data arc meaningless unless evaluated in terms of the 
specific questions which we wish to answ^cr and the methodology neces- 
sitated by suen questions. This section may seem somewhat of a tech- 
nical digression, but it cannot be eliminated from any analysis of the 
effects of practice upon individual differences. Attempts to present 
only a simplified summary of results have proved exceedingly mis- 

^’For summaries ot the relevant litctaluie, the rearier is referred to Kin 'aid (IS), 
Pcicison and Uailuw Reed (17), Anasiasi (2), Bums (?), and Yoshioka 

and Jones (25). 
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leading, since the reviewer in such cases must cither arbitrarily omit 
many of the data or offer conflicting conclusions with no possibility of 
reconciling them. 

Many of the difflcullies met in this problem are inherent in any 
comparison of variability, either from trait to trait (cf. Ch. 3) or from 
one condition to another. As is true in all these cases, if a solution is 
to be found it must be stated in terms of a specifically defined situation. 
Much of the controversy and confusion seems to have arisen from the 
attempt to go beyond the concretely established lads and discuss a 
sort of disembodied abstract "‘variability'’ which is expected to be 
independent of the particular situation in which it has neen measured. 

In any anaKsis of the cftcct of practice upon individual differences, 
it IS necessary to ascertain at the outset what is meant by equal prac- 
iice. If all individuals arc permitted to practice for an equal period of 
time, the slow'cr worf er will be at a disadvantage since he will have 
received practice on less material than the taster individual. The use 
of an equal amount of material, on the other hand, places a handicap 
on the faster worker, who will necessarily have spent less time in 
learning the material than the slow'cr person. The amouni limit method, 
gi\ing the advantage to the initially pooler individual, favors a decrease 
in variability with practice, whereas the time limit method favors an 
increase. 

Each method answers a somewhat different question. The best cri- 
terion for choosing between the tw'O seems to be a practical one. Equal 
training, as the term is used in cveiyday life, usually refers to equal 
time spent in training. When a person takes a "‘course” in music, or 
golf, or wSpanish conversation, he is given a specified number of les- 
sons, each of the same duration. No adjustment is made for the fact 
that during that period the number times a piano key is touched or 
a golf ball is hit, or the number of words spoken differs widely from 
one individual to another. Hie time limit method would thus appeal’ 
to be preferable, but either may of course be used. The important point 
is to take cognizance of which method was used, when interpreting 
the results. 

A second pioblcm which confronts a-> is the choice of a measure of 
progress to be employed. In Table 6 are illustrated three alternative 
ways of reporting the same scores obtained by two subjects, A and 
with the time limit method. In this table, A represents an initially 
faster worker and B an initially slower one. It will be noted that the 
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relation between the gains of the initially better and poorer subjects 
differs with each type of measure employed. When the scores are ex- 
pressed as amount of work done per unit of time, the gain of the 
better subject appears larger than that of the poorer one. 1 his will tend 
to make variability increase with practice. If, on the other hand, these 
same scores are expressed as time per unit of work, the slower indi- 
vidual will seem to gam more. 


TABLE 6 Vanous Wa\s of L\p)e\sinq the I tlccU of Pnictice 
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This apparent conliadiction becomes intelligible if ^ve realize just 
what lime and amount scores arc measuring Since the slow worker 
requires more time on each item, fot every acldiiionai item nhich he 
completes in the Uitci strntes of piactke, he mil he saMm^ much moie 
time than the iasti / it oikei. Thus, if it took the slow worker, B, 5 sec- 
onds to complete one item at the bcgmnmg of practice and if he can 
complete 8 more items aftei practice than he could before, he has 
gained the equivalent of 8 ^ 5 or 40 seconds pe»* trial. The faster 
worker, A, on the other hand, added 10 items to his score, but he only 
required 3 seconds per item at the outset, so he has gamed 10 X 3 or 
30 seconds (cf. method 3, Table 6). The gain in time per item 
(method 2, Table 6) favors the slower worker even further, since it 
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does not take into account the fact that during any one trial this unit 
gain in speed is manifested more often by the faster than by the slower 
worker, the former completing more items. 

It is apparent, then, that the problem of practice and variability 
must be further defined in terms of the measure of progress employed. 
If a choice is to be made among the various measures, amount scores 
will prove more serviceable because ol their wider applicability. In a 
'‘speed” test, amount scores can be employed interchangeably with 
time scores. In a ‘‘power” lest, however, in w^hich the items are arranged 
in an order of progiessively increasing dillkulty, a time score would be 
meaningless. If, for example, in a 30-minute test consisting of 10 prob- 
lems, all the subjects attempt all the problems but the number of cor- 
rect solutions ranges fiom 1 to 10, it would be absurd to report that 
the average time per problem tanged from 30 minutes to 3 minutes. 
The better subjects did not necessarily work anv faster than the poorer 
subjects, since all membcis of the group tackled all the problems. 

A third problem pertains to the meqiiaUix of units in different parts 
of the scale. In many of the tests in which the items arc arranged in 
increasing order of difficulty, the successive items do not progress by 
equal increments of difficulty. Frequently there arc larger “gaps” be- 
tween adjacent items at the extremes of the scale than there arc 
between items near the center. Or theic mav be a relatne scarcity of 
items at the upper end only, or at the lower end only. Such an unequal 
distribution of items would affect the meaning ol dillcrences in total 
scores at diflerent parts of the range. 

Let us assume, for example, that in a particular test the successive 
items are closer together in fiffn^ulty at the low end of the scale and 
farther apart at the upper end. An individual at the low end of the 
scale who obtained, let us say, an ‘ litial score of 16 items correct 
might very easily raise Ins score to 2 1- in the course of practice, thus 
apparently gaining 8 points. Another individual, near the upper end 
of the scale, who began wath a score of 35, might achieve a final score 
of 40, thus gaining only 5 points. In this illustration, the initially 
poorer person makes a larger gam in raw scoie than the initially supe- 
rior performer. If this occurred con^kimtly, individual differences in 
raw score would decrease with practice, the members of the group 
being more closely alike after practice than before. In terms of equal- 
unit scores, hov/ever, the individuals at the low end of the distribution 
may have been improving much less than those at the upper end, since 
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each raw-score unit at the low end of the scale corresponded to a 
smaller ability difference than did a raw-score unit at the upper end. 
An 8-point gain from an initial score of 24 might actually represent 
less improvement than a 5-point gain from a score of 35. Inequality 
of units might thus lead to a completely erroneous conclusion regard- 
ing the effect of practice upon individual difierences. 

1 he changes in work niethod which often occur in different stages 
of practice are very likely to affect the relative distance between suc- 
cessive score units. If, for example, progress beyond a certain score 
requires a more complex organization of simple activities, or the de- 
velopment of a more efiicient procedure, then score units at this point 
probably represent larger steps in a scale of difficulty level. Shifts in 
size of raw-score units may also occur in tasks in which a ‘‘plnsiologi- 
cal limit” is rapidl> approached. This is often true in motor tasks and 
in man> tasks in winch speed is of primary irnportance. In such cases, 
ph> siological or structurally imposed limitations may make progress 
beyond a certain point impossible. As this point is approachcxl, it be- 
comes increasingly ditlicull to improve one's score; the successhc score 
units thus coirespond to progiessively larger difierences in diiliculty 
lc\el. The same ellect occuis when progress is artificially limited by 
the test ceiling. If this ceiling is too low for the sub|eets being^tesied, 
it will ha\e the cflccl ol artilicially leducing individual dilleienccs in 
the course ol practice, since everyone's progress is arbitiarily cut short 
at a relativcK low lc\cl, allhouLh a number of individuals could have 
advanced much farther, 

Pinally, a fourth consideration concerns the use of relative or abso- 
lute /neasiaes \an\ihibt\ in analv/iug practice data. When absolule 
measures arc used, such as the standaid dev ialion, or gross gains made 
by inilialK high and luw individuais or groups, variability lends to 
in( reuse ith practice. When, on the other hand, relative measures are 
emplo>ed, such as the eocfhcicnt of rehilivc variability,' or some meas- 
ure based upon relative or percentage gains, then vaiiabdity decreases 
with practice in most cases. I he fundameiitai objection against the use 
of relative measures has already been discussed in a picvious chapter 
(cl. Ch. 3). It was there demonstrated that, since scores on most 
cunent psychological tests arc not measured from an absolule zero 
point of performance, any ratios or quotients computed with such 

• 100 SD 
Avcrjge 
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scores may be entirely misleading; the addition of a few easy items at 
the lower end of the scale might completely reverse the relationship 
between the obtained values. 

Thus it would seem that absolute measures of variability arc pref- 
erable for a purely negative reason, if for no other. Since relative meas- 
ures are ruled out by the use of aibitrary starting points in the tests, 
no alternative is Iclt. We may, however, inquire more directly into the 
logic of using absolute or relative measures in practice experiments. 
Ihc argument in support of relative measures is that, since the numeri- 
cal size of scores changes in the course ol praciice, the scores aic not 
expressed in the same units throughout and hence absolute measures 
will not be comparable from trial to trial. Through chance alone, the 
argument runs, absolute variability will inn ease \\hcn the size of scores 
increases and decrease when the scores decrease, such changes being 
therefore of the nature of a statistical artifact. 

It is perfectly true that, other things being equal, numerically larger 
scores will exhibit greater variability. Obviously, if the standard devia- 
tion of a distribution of time scores is 10 minutes, the standard devia- 
tion ol the same scores expressed in seconds will be 6t)0. For the same 
reason, the standard deviation of the number of A’s cancelled in one 
minute cannot be compared directly with that of the number of addi- 
tion^ performed during an equal period, since the latter scores would 
be much smaller.^ 

This type of argument does not necessarily hold, however, when the 
same test is given to different groups or to the same group under dif- 
ferent conditions, such as before and after practice. Let us suppose 
that the average score of a certain group 1 on an intelligence test is 
25 points and that of group II, 50 points. It does not necessarily follow 
from this difference in averages that g. )up II will have a larger stand- 
ard deviation than group 1. In fact, the opposite might very likely be 
the case. If group I, for example, consisted of unseiccted third grade 
public school children and group II of superior sixth grade children 
in a private school, the latter would probably have a lower standard 
deviation. Similarly, one could assemble without too much difficulty 
two groups of men whose average u .s. hts were 64 and 72 inches 
respectively, but whose standard deviations were 8 in both cases. It 
would be quite absurd to insist that the taller group is “actually” less 
variable in height than the shorter, or to suggest that the inches used 
Cf. rh. 3 for a fuller discussion 
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in measuring height had in some mysterious fashion changed in value 
from group I to group II. It would seem that the comparison of varia- 
bility before and after practice is more similar in principle to the above 
examples than to the measurement of variability in different tests. The 
use of absolute measures in this connection is therefore justifiable. 

TYPICAL rXPFRIMENIAL UNDINGS ON PRACTICF 
AND \AR1AHIIITY 

The theoretical analysis given in the preceding section would lead us 
to expect opposite results when different techniques arc employed to 
measure the effects of practice upon individual differences. The results 
actually obtained completely confiim tlic various theoretical expecta- 
tions which have been outlined. Thus time or error scores show a de- 
crease in absolute variability with practice (ct. 25); amount scores 
show an increase (cf. 2, 19, 20). Relative variability, i.e., the extent 
of individual differences expressed in relation to the level of perform- 
ance at different stages of practice, nearly always decreases with 
practice (cf. 17, 19, 20). This simply means that the increase in indi- 
vidual differences with practice is not proportional to the increase in 
level of performance. 

When the question of practice and variability, as studied by differ- 
ent investigators, is reformulated in comparable terms, the discrep- 
ancies among the lesulls of different investigations disappear fcf. 2, 
17). A meaningful answer to this question can be given, if the question 
is stated in specific terms. 

An investigation by Anastasi (2) illustrates the procedure and find- 
ings of studies on practice and variability. For the reasons given in the 
preceding section, it was decided to define equal practice as equal tune 
spent in practice and to express scores in terms of amount clone per 
unit time. The scores on each trial of each of the tests were transmuted 
into an equal-unit scale of difficulty. The extent of individual differ- 
ences at different stages of practice w^as measured by the standard 
deviation, a measure of absolute variability. 

Four groups, each comprising from 1 1 4 to 200 college students, 
were given continuous practice in one of four tests. The tests included 
A-Cancellation, Hidden Words,® Symbol-Digit Code Learning, and 

^‘Subjects were to underline all four-letter English words which were “hidden” 
in a page of pied type. 
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Vocabulary Learning The practice consisted of 15 4-minute trials 
in Hidden Words and 20 2-minutc trials in each of the other tests, a 
different group of subjects being employed for each test. The average 
scores and standard deviations of the scores on each trial are repro- 
duced in Table 7. 


TABIE7 A \Cfaqes and Standard D(\ rations of Scores 
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■“I hi SLOUs on all ihe>L test wtit u m muled into an equal unit stole and are 
thus dirtctly ^.omp ii ibk <roni one ^lsI lo Ihc olhti 


It Will be readily sem that the standard dcv> uons rise with practice 
in e\ery lest. It was also found that ’idividuals tend to maintain the 
same relative standing in the group in the course of practice, the corre- 
lations between initial and scores of the same subjects being con- 

Subjects learned by the method of paiicd tssoci ucs a voc ihul iry of nonsense 
vvlldblcs the test is simihr to c'^>d L irninp but more uitlicult 
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sistently positive and usually high For the four tests, these correlations 
were: 

Cancellation 6725 Vocabuhiy S073 

Symbol Digit 2981 Hidden Woids S2S9 

Such corrclatiotis indicate a tendency tor the individual who is best in 
the group at the outset to remain at the lop after piactice, lor the one 
v\ho is lowest to remain at the bottom, and so on This is commonly 
found to be the case in all experiments on practice 



Fig. 45. 1 timing Curves of Ivvo Siib|cus Illusti iting Divti genet with 

PiattRC (IJnpuhl d it i fiom investigation ot Ana tasi 2) 

Both the tendenc> to n.amtam the same relatne position during 
practice and the increase in absvdule variability arc illustrated giaplii- 
cally in Figure 4s I his shows the leainuii^ ennes ol two subjects on 
the Hidden Words test I he subjects were selected near the extremes 
of the distribution, the difference between their initial scores being very 
large It will be noted that the curves do not at any time cross and that 
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they diverge with practice, the difference between the two individuals 
being much larger on the fifteenth trial than it was on the first.^^ 

IHF STUDY or PRACTlCr AS AN APPROAC II TO THI 
HtREDITY-LNMRONMFNr PROllLLM 

Some writers have seen in the data on praeticc and variability certain 
implications lor the undcrl\mg question of the rLlativc influence of 
hereditary and enviionmental factors it has been argued that if indi- 
vidual differences in performance meuast with practice, they can be 
attributed to hercditar> differences, while if they ihiKcise, they must 
have resulted, at least in large part, from inequ ililics ol past training 
and environmental stimulation Probaoly the first explicit lormulation 
ol this hypothesis was made bv F L Ihorndike in I9()S In m article 
appearing in that year, Ik wrote 

Lvpcnmcnts in practice oflci evidence concerning the relative impor- 
tince ol ongina! n iliirc and tinning in dele mining achievement In so 
I ir as the ciiflercnce> among t individu ds in the rbility at the vtirt of the 
experiment aic due to diffcienees o( tiaining the> should be icdiieed by 
fuithcr tiaining given m cc,nil me «suic to all indniduals It on the con- 
ti iiV, in spite ol ec|uil truining the dineicnec** amongst individuals ic- 
main is luge a** evei the\ aie to be attributed to cliffeicnees in oiigmal 
cap leity (2^, pp ^cS4) 

More rccentlv, Ihorndike (24) conducted an analysis which repre- 
sents a pecdic apphe 'ticm ol this hypothcMS 4 lie data consisted of 
the scoics obi lined b) sevt 1 ^ mps ol liigli school and college stu- 
dents on Cooperative lest Sci vac examinations in f nglish, Latin, and 
several niodcrii Itimmagcs J he nuirbei oi students in each group 
varied from ^0 to 21(y^ I oi the pmp )ses of slu h ai’ analvsis, Fhorn- 
dikc consideicd the tunc pent in le umng each c4 these three subjects 
as the contiibution ol environment to achievement on the correspond- 
ing examinations Similailv he re gilded individual dincienccs in score 
within the groups which h ul had the same Icii flh ol training as being 
the result of heredity He then foural the AD ^ ol the combined 

^M)iUeitnt relitions mi bi obtunel if inclividii ils iic il (JiTliuU slices of the 
pi Ktict cLiiVL It ihc St atint point t il Uiey hive hid diffeiint^ tini )urS of relevant 
priLtiLC pnoi to t il I 

“ Avli i^e dcvi ilion beinq; Ihc ivci ict oi tht differcnecs bclwfin e ich indi 
\idu ds score and Ihe iMOiip me in this js a sin plei and vrudcj mLaMiie ol variability 
than the SD 
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groups, throwing together those who had received difTcrent lengths of 
training. This AD would obviously reflect the combined inlluence of 
heredity and environment. From this value he deducted the AD of the 
specific training groups, taken separately. Since the total AD was re- 
duced by only about 25^^, on the whole, by this subtraction, Thorn- 
dike concluded that this per cent represented the net contribution of 
environment to individual differences in the scores on these evainina- 
tions. The major inlluenee he thus attributed to hereditary factors. 

A critical evaluation of this study is given by Flaniilton (13), in 
terms of both statistical methodology and hiterpretation. First it is 
demonstrated mathematical!} that the AD does not lend itself to the 
analysis by subtraction which Thorndike employed. The residual AD 
docs not correspond maLhemalically to the contributii)n of practice, or 
length of training, as T homelike had assumed. Secondly, Hamilton 
calls attention to the fallacy of altiibuting to heredity all individual 
difTcrences which remain when length of training is constant. Such an 
assumption would ignore all the dillciences in the students’ prexious 
experience which might result in dillerences in motivation, study 
habits, previously acquired inlormation and skills, and the like. The 
student'.’ performance in the courses, and conscLiucntlv in the exami- 
nations, would obviously be allected by such antecedent environmenlal 
lacTors. To this should be added the well knowm fact that ngistralion 
in the same course does not signtiv the ^ame amount of time spent in 
learning the sub|ect on the part ol diflercnt students! 

In a stud} designed to avoid the piUalis discussed above, FTamiltc n 
(13) gave groujis of from 22 to 2S filth grade school children 20 
Inals ot each oi three learning tasks, vi/., ailificial language, s^mbol- 
dieit substitution, and “making gates.” By more refined statistical 
anah ses,’ ’ Hamilton dcnuMistraled that the amount ol practice, 1 e., 
the ruimbcr of ti lals wha h the subjects had had at any one stage during 
the experiment, v\,u a mu^e potent determiner achievement than 
T homelike ’s leulit^ had suggested, bor example, when jvrlomianee on 
trials 1 and 2 w is compared with peilormance on trials 10 and 20, the 
“practice eHeet'^ hu exceeded the combined eHcLt ol all other factors 
causing individual didjunees. Table 8 shows the leiative contributions 
of present piaeticc on the one hand, and ol other, residual factors on 
the other, in tins particular comparison. 

^'Analysis of vaii.mc<* aid intraLlass tDiMlat.ori I he \aiiaiicc of a distnbiitioa 
IS the avciage of ilic squaicd aillLrenL%.s ol .-.leh seoic fiom ihc gioup mean, i.e., SD*-. 
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TABLE 8 Analysis of the Conuibution of Pt active to Individual 
Differences 


* rifiiu JJ iiiulton n, j>{( 
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71.43 

28 57 


* hat the conclusions reached by 1 liurndike in the earlier study were 
in p.irt the result ol faulty statistieal methodology was demonstrated 
when Hamilton recomputed some oi TIu)indike‘> ligurcs by the present 
method. Hamilton found that the per cent ol the variability attributable 
to length of training was much greater than had been obtained in 
dhorndike’s anal\sis. She concluded not only that practice plays a 
•iiuch greater role in determining indnidual dillerences m achieve- 
ment than had been suggested by Thorndike, but also that no single 
estimate of its relative conti'butum can be gi\en. She points out that 
the proportional contribution of practice depends upon; ( 1 ) the s^age 
of the learning curve at which individual diflerciiees are measured; 
(2) the amount of practice which intervenes between the trials being 
compared; (3) the heterogeneity ol the grour')s in regard to other rele- 
vant characteristics; and (4 i the Lind of task or skill under consid- 
eration. 

The second and third of the conditions cited by Hamilton will be 
reeogni/ed as a specific application ol omc of the points implicit in the 
concept ot inletiu discussed in Chapter 4 in conr.ection with 

the general problem of heredity and environment. It was thei'e pointed 
out that estimates of propoitional contribution are inconsistent with 
our knowledge of the operation of hereditar\ and environmental fac- 
tors. A different estimate will be obtained in groups which vary in 
their environmental or hereditary ^ ^geneity. In a group having 
highly similar environment, hereditary factors would have a larger 
weight in determining individual dillerences. Conversely, in a group 
highly similar in heredity, environmental factors would exert a rela- 
tively greater effect in the development of individual differences. It 
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shouJd be noted that the same relationship holds when determining the 
relative weights of two difterent environmental factors or two dilferent 
hereditary factors. Thus in the above experiment, if we wish to com- 
pare the relative contribution of immediate practice, i.e., number of 
trials or length of training, with the contribution of antecedent factors 
(environmental and hereditary), the estimate would vary as cither 
present practice or prior conditions vary. For example, in comparing 
individuals all of whom have had exactl} the same number of practice 
trials, the contribution of immediate practice would obviously be zero. 
Similarly, it the scores on trials 15 and 16 are compared, the role of 
practice in producing score diflerenccs will appear to be relatively 
small. When, on the other hand, we compare groups which differ 
markedly m the number of trials which they have had, then the role 
of practice in individual differences will be large. If we arc interested 
in discovering how lar practice may account for individual differences, 
then we should obviously give practice the opportunity to operate, by 
comparing individuals who differ conspicuously in amount of piaclice. 

In concluding the present section, one other implication oi piactice 
and variability studies ought to be examined. It is sometimes argued 
that when s ib|ects undergo a prolonged period of equal training, the 
differences in their past experience with the given task are thereby 
wiped out. This assertion is open to question. \ he inlluencc of environ- 
mental factors upon the development of the individu.il is ordinarily 
cunnilaiixe. It one individual^ past experience has made him more 
prohcient than .mother m a ceitam lask, wc should expect him to be 
belter hlted to profit from instruction lor that very reason. Suscepti- 
bility to tiaiiiing can itself be envuonment.illy ciclcrmincd, and if so 
determined there is no reason to assume that it will disappear wuth 
additional training. 

The individual who h.as been handicapped by a “poor’" environment 
may lack the nccessaiy intellectual toids to pioht from instruction. 
Thus, had the Wild Ifoy x\ve\ron (cf. Ch. 6) and a boy of the same 
age from a middle-class 1 nghsh hornv been put through an identical 
one-year course in the reading of French, the differences in their abili- 
ties to read that language would have been lar greater at the end of the 
course than at the beginning. Similarly, investigations on the acquisi- 
tion of motor skills (cf. 14) have demonstrated that individuals who 
have been taught more efficient work methods not only have a head 
start, but with continued practice gain progressively more than others 
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using less efficient methods. It is obviously unnecessary to assume a 
hereditary basis lor individual diflercnces m order to account tor the 
increase in variabilil> in such examples The more the individual has 
learned iii the past, the more he will be ible to learn in the present To 
use a rather crude analog}^ we might say that practice does not add 
to the individual s ability, but multiplies it 
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Schooling and 
Intelligence 

A iivEiY CONTROVERSY has Centered about the efTccts of schooling 
upon intelligence test performance. I he divergent conclusions reached 
by dilTcrcnt investigators have resulted at least ’n part from inadequate 
clarilication of undei lying concepts — a tact which has led to prolonged 
critiques, replies, rejoinders, and counlci rejoinders in the journals, 
with the participants being no closer at the end than they were at the 
outset. Scoies of investigations have been conducted, some emphasiz- 
ing and others minimizing the role ot schooling m the development of 
intelligence. When the mass of available d<ita is sifted, no very star- 
tling discovery regarding heredity and environment emerges. As an cx- 
pcMinental approach to the problem of heredity and environment, the 
study of the effects of schooling leaves much to be desired. Some of 
its limitations will he considered in the anal>sis of the theoretical 
implications e^f this method, to be given in the concluding sections of 
the present chapter. 

If these studies have contributed little to the sum total of our knowl- 
edge regarding the factors operative in behavior development, they 
have nevertheless indirectly stimulated a thi)roughgoing evaluation of 
practices commonl)/ followed in mental test studies. In the course of 
the controversy, attention has been focused upcm the methodological 
requirements of such investigations. Needed cautions in the interpreta- 
tion of statistical data have been clearly expe unded, and rigid stand- 
ards for the control of conditions ha» ' ''cen set forth. In looking over 
the critical literature concerning the studies on "‘schooling and the 
IQ,” one cannot escape the impression that higher standards were 
demanded than had heretofore been commonly applied in most mental 
test studies, on any topic In their zeal to counteract a too sweeping 
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generalization or a premature publicizing of results, the critics some- 
times outdid themselves. The net outcome, however, has been a posi- 
tive contribution to the development of sound methodology in mental 
test studies and an increasing awareness by investigators in this area 
of the need foi experimental controls and careful evaluation of statis- 
tical findings. 

At the core of the controversy arc a group of studies dealing with 
the eflects of nursery school attendance upon the 10. It is with this 
pfescliool level that the largest number of investigations and much of 
the discussion have dealt. Nevertheless, c number of studies which 
followed essentially the same procedure have been conducted at higher 
educational levels, from the elementary school through college, and 
these will also be included in the present chapter. A few studies con- 
cerned with the influence of special educational techniques, ^uch as 
specialized courses ol training or specially designed curricula, will also 
be considered. I he latter studies are somewhat related in general 
approach to the coaching studies covered in the preceding chapter. 
They dilTer, howevei, in that the training is much farthci removed 
from the actual test content and was not designed with reference to the 
test 

In\esiigatioiis on the efTccts of schooling also have cerlaTn features 
in common with studies on the effects of various institutional emuon- 
ments. The latter will be discussed in Chapter 1 1, in conjunction with 
the invcstig.ition of foster home environment. All the studies treated 
in the present chapter deal specific illy witJi schooling or training, as 
distinmiished from the more gener<il factors operative in the “home” 
or “living’ environment ol the individual. 

THI llircfoi SPT ClAl l Dl CATION \L TIC UNJQUl S 

A lew investigator*^ have been interested in the possibility of raiding 
the intellectual performance level of dull or feebleminded subjects by 
means of specially designed, intensive programs ol training. In a study 
by Kephart (25, 26), 16 boys living in a single cottage in a training 
school were given special instruction for a period ranging from six 
months to nearly three years in individual cases. The aim of the pro- 
gram, according to the author, was to stimulate constructive activity 
and to encourage ingenuity, initiative, and original planning. Concrete 
materials, social situations, and abstract problems were included in the 
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training. Among the latter were problems involving the recognition of 
absurdities in stories, a task which has much in common with some of 
the Stanford-Binet tests. At the beginning of the experiment, the age 
of the group ranged from 15 to 18, with an average of 16-6 (i.e., 16 
years and 6 months). Initial Stanford-Binet 10 averaged 66.3 and 
ranged from 48 to 80. At the end of the experimental period, the 
mean IQ had risen to 76.4. Individual gains ranged from 2 to 22 IQ 
points. All subjects gained upon retesting except one, who lost 3 
points. Half of the group gained 10 or more points; one fourth gained 
15 or more. Control data were obtained within the same group, as 
well as in an equated group. I'csts given to the experimental group 
over an eight-month period prior to the initiation of the special train- 
ing program showed a mean rise of only 2.3 IQ points, the individual 
changes ranging from —7 to -f 18. A control group of 26 boys of 
the same age and in the same institution as the experimental group 
was tested over the interval in which the experimental group partici- 
pated in the training program, 'fhe mean gain of this control group 
on the retest was 1.9, with a range from —10 to -t-15. 

An extended project on the education of mentally retarded chil- 
dren reported by Schmidt (45) luis aroused a storm of controversy. 
In this study, 254 hoys and girls between the ages of 12 and 14, who 
had been referred to special classes, were put through a three-year 
educational program especially designed for them. The average initial 
Stanford-Binet IQ reported for this group is 52.1, with a range from 
?7 to 69. The subjccls were tested periodically with intelligence, edu- 
cational achievement, and personality tests during the three-year train- 
ing period, as well as during a hve-ycar i‘ollow-up aftei the completion 
of the experimental program. 1 he degree of progress in all aspects of 
behavior reported in this study far tstrips that found in any other 
investigation to date.^ At the completion of the investigation, a mean 

^ In view of' its scope, clntation, rind wealth of obscrvatuin'^. this study deserves 
sciioiis eonsideration Ar the same rime, the mvestmaioi hoi seif calls attcirtion to the 
wide divert’ciice of these icsults lioni coincniiona! juofessKmai opimon and points 
out the need for independent veimcation A cntical analysis of this siud>. lojzether 
with suggested reasons for its discrepant results, will he included in the general 
evaluation at the conclusion of the piescnt sect ion. 

An advejse ciitujue by S A. Kirk, togethe. ' :h a reply by Schmidt, appeared in 
Ps}ch()l. Hull., 1948. 4.^, 321-343. Since the icviewci did not have access to all of 
Schmidt’s cases, and since bis sources of data show ccitain mieinal incimsistencies, it 
IS difficult to draw any contlvision lioin this exchange of comments At this stage the 
sale4 conclusion is that tfe study offers valuable leads for further reseaich. The 
reader should also consult the cntical review by F. L, Goodenough in /. Ahu. Soc, 
Ps^ihol, 1949, 44, 135 -140. 
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gain of 40.7 IQ points was observed; 80.7% of the subjects made gains 
of 30 or more IQ points and 59.6% gained 40 points or more. The 
larger part of these gains occurred during the three-year experimental 
period, although in the course of the subsequent five-year follow-up 
the IQ\. showed continued gains, rather than dropping toward the 
initial level. 

The progress in educational achievement reported by Schmidt is 
equally remarkable. Although the average educational performance 
at the beginning of the experiment fell within the first grade, by the 
completion of the three-year program it had reached approximately 
fifth grade level. Moreover, 79 subjects transTcrred to the regular ele- 
mentary school cither to qualify for immediate graduation Irom the 
eighth grade or to complete the elementary school course in regular 
classes. During the five-year follow-up period, a large number con- 
tinued their education in technical, business, or avocational courses, 
and 27 of the original group had graduated from high school by the 
termination of the study. Data on subsequent occupational history, 
socio-economic status, community activities, and the like during the 
follow-up showed the group to have made a very satisfactory adjiiNt- 
ment. 

As a control group, Schmidt employed 68 children, also enrolled 
in ''pccial classes for tlic inlellectualiy deficient but not participating 
in the experimental program. The control group was approximately 
equated with an experimental sub-group of 64 cases in initial IQ, edu- 
cational achievc!iieni, and chrcmological age. The mean gain of this 
experimental sub-group was 23.8 IQ points, while the control group 
lost an axerage of 3.6 points during the same period. Marked differ- 
encc'^ in educational progre .s and in subsequent vocali(mal and social 
adjustment were likewise found between these tw'o groups. 

Both of the above studies suggest that special training may exert 
considerable infiuence upon intellectual development, fhe reverse con- 
clusion was reached by two other studies conducted on dull-normal 
and normal subjects. In one of lhc.>c (42), ] 1 1 dull-normal children 
were gix^m the Stanford-Binet before and after a two-year period in a 
school oTcring an ‘‘expcricrcc curnculum.'’ The authors report that 
the CLiiTiculuni was especially planned to stimulate intellectual activity 
among slow^ learners at this ability level. Pupil interest is described as 
very high and truancy was virtually eliminated during this program. 
It was originally planned to admit to this cx^ursc only children wdth 
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lO’s between 75 and 90, but the sampling actually included 10 cases 
between 60 and 74, and 6 between 96 and 104; the latter were ad- 
mitted because they represented decided school failures, despite their 
near-normal IQ’s. The initial mean IQ of the sampling tested was 
85.12. Initial age ranged from 5-8 to 12-3. The mean change in JQ 
during the experimental period was slight and similar, in both direction 
and amount, to retest changes found by Tcrman and others when the 
Stanford-Binct is readministcred with no special interpolated ex- 
perience. 

Similarly, the Stanford-Binet fO’s of 141 children who completed 
the fourth grade of a demonstration school were not increased by par- 
ticipation in what is described as a “rich and vital school curriculum” 
(28). The mean initial IQ of this group was 109, with a range from 
89.0 to 134.6. Fifty-one of these children, who were retested over a 
four-year period, showed a mean 10 loss of 1.53 points. y\ group of 
74 retested after throe years gained an average of 1.48 points, and 89 
children retested within two years made an average gain of .06. None 
of these differences is statistically significant. 

A somewhat different approach is illustrated by an experiment (54) 
in which 30 college sophomores were given six weeks of training in 
general semantic methods. Scores on the Detroit Intelligence Test, 
Advanced Form, rose an average of 36 points during this period, in the 
experimental group. The control group, which had received no such 
training, showed an average rise of 6 points. In terms of the national 
norms for this test, the gain made b\ the experimental group repre- 
sents a rise from the 62nd to the 96th percentile. 

Mention may also be rr dc of the rcMilts achieved with the army’s 
Special Training l^nits during World War 11 (6). Tiirough an intensh'c 
12-week course of instruction in these units, men who had been illit- 
erate or of very limited education vere brought approximately to the 
fourth grade elementary school standard in reading, language expres- 
sion, and arithmetic. At the same time, their performance on the 
AGCT rose from Grade V, the lowest army grade, to Grade IV or 
even higher. Approximately 85^ of the men selected for such training 
succeeded in reaching such standards Had the initial classification of 
these men been regarded as an inoex of their “native intellectual 
capacity,” without reference to their educational and other experiential 
limitations, the possibility of “raising” them to Grade IV level would 
have been overlooked. 
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In evaluating any of these studies, a fundamental question is: How 
broad or how narrow was the effect of the particular training which 
was furnished? It is not at all surprising that the results should differ 
with the nature of the training, the degree of similarity between the 
trained functions and the functions sampled by the tests, and possibly 
the initial intellectual level of the subjects. 

In connection with the positive findings reported by some of these 
studies, one may ask to what extent the improvement was limited to 
functions closely similar to the tests, and to what extent other intel- 
lectual behavior had also improved. In so far as the educational 
achievement of the subjects in the Schmidt study also showed marked 
gains, and in the light of the subsequent vocational and social adjust- 
ment of this group, the area of improvement appears to be consider- 
ably broader than that of the test. This is also true of the results 
obtained in the Army Special Training Units. It would be misleading, 
how'ever, to assume that the rate of development of all behavior func- 
tions had been accelerated by such training, or to speak of improve- 
ment in some mysterious ‘‘underlying capacity.’' What is affected is 
observable behavior, and the breadth of behavior so influenced can be 
empirically determined for each type of training. 

Applying the same analysis to the two studies which yielded nega- 
tive results, we find, first, that such training procedures as are sub- 
sumed under the “experience curricula” and the “rich, vital curricula” 
of progressive education seem not to affect appreciably the 10 of most 
normal or borderline children. Taken as it stands, this finding is not 
too unexpected. Such curricula generally emphasize interest, individual 
initiative, practical applications, and a number of similar features 
which mav help the general adjustment and achievement of the indi- 
vidtiaL both in school and out. I3ut such instruction is not oriented 
tow^ard improving the type of behavior funcliv^ns wliich are predomi- 
nantly sampled by most intelligence tests. Among the latter functions 
we may note, for example, abstract verbal and numerical ability, mem- 
ory, attention to details, speed of routine work, and following direc- 
tions minutely and without hesitation. Whether other intelligence tests 
should be devised to sample different behavior functions, or whether 
the progressive curricula cover the most desirable functions to be 
developed in any one group, is of course entirely beside the point. 
What is relevant in the present connection is the fact that different 
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curricula or courses may differ widely in their coverage of the type of 
behavior functions sampled by intelligence tests. When the problem is 
viewed in this light, the results of the studies which have been cited, 
although widely divergent, need not be regarded as inconsistent or 
contradictory. 

in the Schmidt study, which reports the most conspicuous effects of 
training, the experimental program was very broad in its coverage. 
Although it, too, was concerned with the stimulation of pupil interest, 
a multiplicity of procedures which might lead to intellectual improve- 
ment were included. For example, the attention given to the develop- 
ment of effective work and study habits and to the attainment of min- 
imum levels of performance in reading and language usage may 
account in part for the continuance of improvement after the termi- 
nation of the experimental period. These skills provided the necessary 
tools for further progress. The care taken to adapt instruction to the 
specific needs and deficiencies of each individual may also have con- 
tributed to the effectiveness of this training. 

On the other hand, it would undoubtedly be rasli to generalize these 
rciiults to all cases of intellectual backwardness. Selective factors prob- 
ably operated to make the particular sampling of this study more sus- 
ceptible to rapid improvement than wcaild be the case among intellec- 
tually retarded subjects in general. The group as a whole came from 
very inferior socio-economic backgrounds, where opportunities for nor- 
mal behavior development were poor — a condition with which the ordi- 
nary elementary school could not adequately cope. There is the further 
likelihood that the educa 'onai and consequent intellectual develop- 
ment of a number of these subjects was initially hampered by sensory 
defects, poor health, and language handicap. All these conditions, 
could be — and probably were — largely remedied in the course of the 
experimental period, which would account for some rapid gains in 
educational performance as well as in IQ. The subjects as a group 
were likewise socially immature and poorly adjusted emotionally at 
the outset. Marked improvement in these respects is reported in the 
course of the experiment. Such improvement would in turn affect in- 
telligence test performance both directly and indirectly: directly 
through greater alertness, interest and cooperation during the test 
itself, and indirectly through an increase in the effectiveness of learn- 
ing in general and in the acquisition of those skills which are sampled 
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by intelligence tests. It is interesting to note in this connection that 
those individuals showing the greatest improvement in emotional and 
social adjustment also tended to show the greatest gains in IQ. Thus 
the correlation between improvement in Stanford-Binet IQ and in 
Bernreuter BIN score (emotional adjustment) was .923, and that 
between IQ gain and gam in the Vineland Social Maturity Quotient 
was .874. 

In the light of these considerations, the results reported by Schmidt 
are probably not so startling as they might appear at first sight. A con- 
clusive evaluation of this study, however, would require more infor- 
mation than is provided by its author. Especially would it be helpful 
to know more about the detailed procedures employed both in the 
training program and in the testing. 1 he presence of a number of 
minor arithmetic errors and inconsistencies in the published results 
also suggests an unfortunate carelessness in reporting data. 

In summary, it is apparent that both the nature of the training and 
the nature of the subjects determine the degree to whicli intellectual 
performance level can be raised by training. Much more research is 
needed to ascertain the relative effectiveness of dilferenl types of train- 
ing for different individuals, as well as the relationship between sub- 
ject characteristics and susceptibility to training in general. 

STUDIFS ON PKI.SC'IIOOI, ATI HNDANCE 

Over fifty investigations have been conducted to determine what effect, 
if any, preschool attendance at a kindergarten or nursery school has 
upon the child's IQ. A few studies (c.g., 8, 34, 35, 48 ) give only the 
intelligence test scores of a nursery school group before and after a 
period of preschool attendance. In such studies it is impossible to 
determine how much of the change in score may result from retesting 
or from the time of the year when the tests are given. A control group 
is essential for this purpose. Another group of studies (e.g., 29, 39, 
40, 41, 56) report only the relative performance in the first grade, or 
at subsequent scholastic levels, of two groups, one of which had 
attended preschool while the other had not. The dilTiculty with this 
procedure is that nursery school attendance may be — and probably is — 
itself selective. Even w4ien the groups are equated in parental educa- 
tion and occupation, as well as in other broad categories, selection may 
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have occurred within these categories. For example, among families 
with the same educational, occupational, and socio-economic level, 
those parents who enroll their children in nursery school may still have 
JifTered in intelligence, personality characteristics, interest in the chil- 
dren, and other subtle and inconspicuous ways. A local type of selec- 
tion may also occur in certain nursery schools which offer special 
remedial services. In such cases, the children with defects of speech, 
personality, etc., arc probably more likely to be sent to the preschool. 
Corroborative evidence for the operation of such selective factors is 
furnished by some of the investigations (cf. 29, 56). 

The most direct analysis of the effect of nursery school attendance 
is based upon the intelligence test scores of a nursery school group 
before and after a period of preschool attendance, together with the 
scores of a matched control group tested and retested over the same 
interval. This is the procedure which has been followed by the major- 
ity of investigators. In some of the studies, successive retesting at 
intervals within the preschool period permits the investigation of cumu- 
lative effects and the determination of the course of the changes 
throughout the pcrirKl. Moreover, a follow-up of the experimental (or 
preschool) group and the control group for several years permits a 
study of the permanence of tlx. effects observed. 

In their interpretations, most investigators have aligned themselves 
Jclinitely on one side or the other of the controversy. Some lay great 
emphasis upon the dilTcrenccs which have been found in favor of the 
nursery groups. Others stress the smallness of such differences and 
their complete absence in some of the groups. The findings vary, to be 
sure, for a variety of reasons to be considered in a later section. But 
the interpretations vary more sharply than do the results. The data 
appear to fall, not into two categones, the pro and the con, but rather 
into a continuum of slightly varying effects, which may be related to 
the conditions of the investigations. 

In a summary of about fifty studies on nursery school children by 
diiferent investigators, Wellman (61 ) reports the results obtained with 
several intelligence tests. The largest number employed some revision 
of the Binet scales (Kuhlmann-Bim i, 1916 or 1937 Stanford-Binet). 
When the results of these studies were combined, the mean gain by 
1537 children in 22 nursery groups was 5.4 10 points; the mean gain 
by 597 control, non-nurscry children in 14 groups was 0.5. Mean gains 
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of over 6 points were reported for 50% of the nursery groups and 14% 
of the non-nursery groups * 

The Mcrrill-Palmer Scale shows a larger influence of nursery school 
attendance, probably because of the greater similarity of its content 
to nursery school activities The mean gam of 267 children m 7 pre- 
school groups on this test was 14 5 10 points, that of 73 non-preschool 
children in 4 groups was 6 7 A mean gam of over 10 points is reported 
for 5 ol the preschool but only one of the non-preschool groups A 
few studies emplovcd a number c^f other intelligence scales, siiJi as 
the Minnesota, Gescll, and California Presciiool Schedules On these 
tests both preschool and control groups tend to gain m me in score 
upon retesting, with no significant or consistent advant igc of the pre- 
school groups 

The comparison of gams made o\er initial and subsequent intcrv ils 
of nursery school attcndincc shows that in ncarlv group the 

increase^ in scoic arc cumulatnc Later gains are however, slight, and 
the evidence stronglv siiggcst> that the major impio\cment m intelli- 
gence test perform ince occurs during the fast fm months of nuisciy 
school tittcndance 

Many of the nursery school studies covered m the Wclln^an sum- 
mary were based upon a small number of cases, an\ studv co\cr ng 
10 or more children h i\ing been included in the sur\e> In a number 
of the studies the conditions of the investigation oi the anil\sis of 
data were such as to m ike c\ iiuation dillicult Among the more ambi- 
tious projects, from the viewpoint of iiumbei of cases, durition of tht. 
obscivations, and number ol fact us invcstugatcd aie those conducted 
at the limvcrsilics of Iowa, Minnesota, and Calilornia. 

Wellniin ind her coll iboiators at the University of Iowa have con- 
ducted in expended senes ot pioj^els on the cfleets of nuistiy sehool 
attend mce upon mlelligcnce tc^t peilormancc (60) The pnncip d d ita 
WCTC derived from a told sunphn^ of childien between the iges ol 
18 and 77 months, who were ittend ng either the nuisery school or kin- 
dergarten conducted by the univeisity \]\ wcic given cither the V)\(^ 

^ The si ttistic il si^nificnn^e of the is is not ilwiys icpoilcd in these stiul t 
On tht b ISIS ot IV nl ibk ciiti the sIik' ts stem lo be vhoiit ctiii illy (IivicIlcI mo 
those which meet he common cult ii of i sieml t mt dilkicncc btlwccn Ima* sooits 
ot nursery mcl nonnursi^iy j.ioiips and those ^^hIch do not Tn the comparison of 
ini lal and fmil mt ms of the m istiy foi ps the difTcitnecs iic dso si^,)ificmt m 
about one h ilf ol the studic ind insienific mt m the other hilf In tht non nursciy 
giOLips however signihc int inipfovcincnt is ncly louiid 

Included m Wcllmin sumini/v as ue those rtpoiltd in 12 24 41 6^ 
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Slanford-Binet or the Kuhlmann-Binet in the fall and again in the spring ^ 
of each year of preschool attendance. The mean difference between fall 
and spring tests during the first year of attendance was a gain of 6.6 IQ 
points; the changes ranged from a gain of over 40 to a loss of over 30 IQ 
points. Slightly over half of the children showed a change of 8 or more 
points. 

Within the total sampling. 228 subjects attended preschool for two 
years or more, 67 of these attending for three years or more. Analysis of 
these sub-groups indicated that the mean score rose during successive 
years of preschool attendance, but the gains became progressively 
smaller. Thus the two-year group showed a mean fall-to-spring rise of 
7.0 and 3.8 points, respectively, during their first and second years. The 
three-year group gained an average of 7.7, 4.3, and 1.7, respectively, from 
tall to spring testing during each successive yeai."' That length of attend- 
ance at nursery school bears little relation to the amount of gain was also 
demonstrated by the absence of signilicant coi relation between gain and 
number of days of actual attendance during the year, the lattei ranging 
from 37 to 148 days for individual children. Moreover, no relationship 
was found between the exact length of time which bad elapsed between 
any one individuals fall and spring testing and his gain in score. 

No relationship was found between amount of change in IQ among the 
nursery school children and the occupational or educational level of their 
parents. It should be noted, however, that the gioup as a whole came from 
superior occupational and educational levels Had the spread of parental 
characteristics and of home environments been wider, some relationship 
might have been found between these chaiaclcrislics and gain in TQ. 

In order to obtain control data for the evaluation of the observed gains 
in score, Wellman compared .^4 pairs of preschool and non-preschool 
children, matched in chionological age and initial IQ. Between the fall 
and spring testing, this preschool group gained an average of 7.0 points, 
while the control group lost *in avera-e of 3.9 points. The mean difference 
of almost 1 1 points between the two groups on the spring test was statis- 
tically significant. 

Follow-up stiuJics in elementary school, high school, and college have 
also been conducted on nursery and non-jiursery groups in the Iowa 
project. In a frequently cited study by Kounii, (27), the achievement of 
22 preschool and 31 non-preschool children was compared during the 

The “full” tests were iziven between Auiiust 1 and December ^1, most of them 
oceurnng in Oetobei and November; the “spring” tests extended from March 1 to 
June 30, with the masinuim concentraiion m April and May. 

"‘The net gains o\cr the total period arc smaller than the sum of these annual 
gains, since a slight loss in mean scoic occurred during the summer months The 
“initial” score each fall was thus slightly lower than the “linal” score of the pre- 
ceding spring. 
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first four grades of elementary school. The two groups were approxi- 
mately equal in initial IQ, the mean Binet IQ's for the period prior to pre- 
school attendance being 1 18.3 and 1 17.7 for preschool and non-preschool 
groups, respectively. Achievement test scores in arithmetic showed no 
significant difference between the two groups; in reading achievement tests, 
slight and not very significant differences were found in favor of the pre- 
school group. In school marks, no significant difl'crcnccs were noted during 
the first two years, but significant differences in favor of the preschool group 
appeared during the third and fourth school grades. The delay in the 
appearance of this difference might suggest ^hat the preschool and non- 
preschool samples actually differed in intellectual level (through differ- 
ences in home background or any similar factor other than nursery school 
attendance), but that such differences did not enter into the type of 
behavior sampled by intelligence test^ at the preschool ages. Thus the 
initial equating of mean IQ’s between the two samples would not he suffi- 
cient to rule out other relevant differences, and the later divergence in 
school achievement could not be conclusively attributed to the effect of 
nursery school attendance. The evidence presented by this study is ren- 
dered even more uncertain by the small number of cases involved in some 
of the comparisons, since by the end of the fourth school year the grt)ups 
had shrunk to 10 preschool and 8 non-preschool children. A further dilfi- 
culty is the fact that the groups compaied at the upper gradifs were no 
longer equaled in initial IQ, owing to the selective elimination of cascs.^'* 

In another study, Wellman (58) compared 29 preschool with 29 non- 
preschool subjects who had been matched on initial IQ and years of 
school attendance subsequent to preschool. The intelligence test scores of 
this group during the high school period yielded a negligible and insignifi- 
cant difference in favor of the preschool group. Equally insignificant was 
the difference found during the college period between initially matched 
groups of 19 preschool and 19 non-preschool cases." 7 he high school and 
college groups overlapped, some of the same subjects being included in 
both groups: hence the results of the two comparisons cannot be regarded 
as independently corroborative. One can only conclude from these studies 
that no pi'olongcd effects of nursery school attendance upon cither intelli- 

‘^‘To be sure, the diO'crences in school marks in favor of the preschool jiioup in 
the third and fouifii giiidcs remained vvhen onlv individuals of approximately the 
same IQ’s were compared, bu' this necessilaied a still fuilher leduction in the 
nuniher of cases compared 

I he mean diflerenccs in iniiKtl IQ in favor of the preschool pJ'oups were O.S and 
2.7 for the hijih schoed and coMcjic samples, tcspcclively. I he coriespondiiva dilTei- 
cnces in pciccntilc scores on the American Council I*sycholo^ical Examination and 
the Iowa College Entiancc E,\aminalion were 9 8 and 12.0. d hesc two dilferenccs arc 
1.9 and 1.7 times as large as their respective standard errors, thus being qurte insrgnifi- 
cant. Moreover, the averaging of percentile scores in these data distorts the resiills 
somewhat and makes their interpretation more difficult (ct. Ch. 2). 
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gcncc test performance or school achievement have been satisfactorily 
demonstrated. 

Tn another investigation conducted by the Towa group (46, 62), two 
matched samplings of normal and borderline orpluuwf^e children were 
studied over a three-year period. The experimental group attended a pre- 
school conducted at the orphanage; the control group did not. The two 
groups were matched in initial IQ (on either Kiihlrnann-Binct or 1916 
Slanford-Binet) , CA, sex ratio, length ol residence in the orphanage, 
nutritional status, and presence or absence of sensory defects. The prin- 
cipal finding of this study was that the mean IQ of the control orphanage 
children dropped during their institutional ro'^alcncc. while that of the 
preschool group either lose slightly or showed a negligible change. Pre- 
school attendance in this group seems thus to ha\'c couniciacled the rela- 
tively “unstimulating’' environment of the orphanage. This difference 
hetween preschool and control children was observed at all IQ levels, 
although it was not significant throughout. 1 he decrease in IQ in the non- 
prcschool group was piogressivc with increasing length ot institutional 
residence. Thus no substantial change in IQ ^^as ob-,ervcd in thi*' group 
over the shortest residence period studied (averaging 115 days). At the 
other cxlrcine, children whose stay at the orplianage averaged 642 days 
lost an average of 16.2 IQ points. The cffecl of preschool attendance 
upon the IQ, on the other hand, was manilcstcd early and showed little or 
no suhseqiiein change with continued attendance. 

Among the various criticisms which have been directed against this 
study, one or two are paiticularly relevant to the principal results cited 
above. Jl has been pointed out (15, 36), for example, that adequate 
matching of the picschool and non-preschool guoups was not sustained 
throughout the cxpeiimcinal period, hecaiisc of the removal of children 
for iostcr liome placement and their replacement in the experimental 
groups bv substitutes. As a result, the 47 ciiildieii who at one time or 
another were in the preschool gioep aveiag;cd 3.4 ]Q points higher than 
the 44 childien in th>' contu>l group. The preschool gnaip had a ms'an 
initial IQ of 86.6, witn a range from 65 to 163; the initial IQ's of the 
control group averaged S3. 5 and langed liom 57 to 1 14. 

In a re-analvsis of the data (62), it was demonstrated that among the 
children watli over 400 days of orphanage residence, those who had 
attended preschool differed significa’Uh in final but not in initia' TQ. Within 
the same rcMdencc group, the presaiool children w'ho had actually at- 
tended the preschool for les^ than half ot the total r umber of days did not 
dilTcr significantly in IQ irom the control group at the end of the observa- 
tion peiiod. Those who had attended the preschool for more than half of 
the period, on the other hand, showed a clearly significant ditTcrcnce 
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from the control subjects. In the last-mentioned comparison, the pre- 
school subjects gained on the average 6.8 IQ points, while the control lost 
5.1 points. This analysis suggests that the preschool training did actually 
serve to raise the children’s intelligence test performance. Such a conclu- 
sion, however, requires further corroboration because of the small num- 
ber of cases involved in any one of the specific comparisons made. A 
closer matching of the control and preschool groups in initial IQ and age 
would also permit a more precise interpretation of the observed gains and 
losses. It should also be noted that certain other conclusions drawn from 
this study by the investigators are open to serious question and will he 
considered in connection with methodological problems in a subsequent 
section. 

One of the most carefully controlled studies on the effects of nursery 
school attendance upon IQ is that conducted by Goodenough and Maurer 
(18) at the University of Minnesota. A total of 147 children who had 
attended nursery school from 40 to 575 days w'crc compared with 260 
children having no nursery school training. All subjects had been tested 
at least twice with either the Minnesota Preschool Scale or the vStanford- 
Biriet, an interval of at least one year having elapsed between tests. The 
children as a whole were above average in parental occupation and in 
their own initial IQ's, which averaged close to J 10. 

A special precaution was to have the tests administered b>^ examiners 
who had no knowledge ol the child's previous test performance and, in 
the case of at least 8()‘'7 of the children over 6, no knowledge of their 
previous nursery school attendance. Moreover, the examiners were not 
connected with the nursery school. Thus their degree of mutual acquaint- 
ance ^^\\h the nursery group was no greater than with the non-nursery 
group. It is pointed out that such acquaintance might have had a two-way 
effect, through both the examiner’s and the child’s attitude, in laising the 
score of the preschool group. vSpccial ctlorts were also made to secure 
conditions of maximum motivation. Children were not tested on their 
first visit if they exhibited negativistic behavior. If a continued uncoojKTa- 
tive attitude still rendered testing unsatisfactory, the case was excluded 
from the study. 

The mean gains obtained in the retesting with the Minnesota Preschool 
Scale after one, tw'o, and three years of nursery school attendance are 
given in Table 9. The advantage in favor of the nursery school group, 
according to these data, is cither negligible or lacking. It was further 
shown that amount of nursery school attendance bore no relation to sub- 
sequent rise in IQ. The correlation between Stanford-Binct ]Q at age SVz 
and number of days of previous «atlcndancc at nursery school, with initial 
IQ constant, was .013. In subsequent follow-ups, the mean initial IQ on 
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the Minnesota Scale was compared with Stanford-Binet IQ at ages 5 Vi, 
6*/i, 8y2, I0y2, and \2Vi for both preschool and non-preschool groups. 
Although both groups tended to improve in the later tests, the improve- 
ment did not favor the nursery group. In fact, many of the comparisons 
show a significant advantage in favor of the non-nursery group. 


TABLK 9 IQ Cain in Relation to Length of Nursery School Attendance 


(.n.ita fiotn ^ loodciiottj.’^h and flaunt, 18. pp. H)0-J7|) 


Ci roup 

Onc-ycai Retest 

N Mean IQ Gam 

Two-year Retest 

N Mean IQ Gain 

Three-year Retest 
,V Mean JQ Gam 

Preschool 

Control 

84 4.6 

122 4.6 

5 1 6.2 

29 4.6 

13 5.8 

15 4.0 


In explanation of the greater gains of the non-nursery group, the authors 
point out that selective elimination operated more markedly in the non- 
nurscry sampling. Children dropped out of this group in greater numbers, 
and those who dropped out tended to be of lower IQ, than was the case 
in the nursery group. It was the children of the intellectually superior 
parents, in general, who tended to remain in the study in the non-nursery 
group. From this observation, the anlhois go on to suggest that, as children 
r^row older, they approach more closely their “true” intellectual level and 
therefore come to resemble their parents more closely, the implication 
being that this resemblance is primarily a matter of heredity . It should be 
noted that, logically, the reported facts aie equally consistent with an 
evplanation in terms of Wv' environmental effect of superior homes. vSince 
the non-nursery children who lemained in the study came from superior 
homes, development in such homes would stimulate a rise in the functions 
sampled by intelligence tests. The nu. -cry group, according to the authors’ 
own report, did not undergo so much selection in terms of home environ- 
ment, and would therefore be less likely to improve. Whatever gains 
resulted from pieschool attendance would thus have to counterbalance 
the greater gains resulting from home influences in the control group. 

The fact that motivational differences were ‘dimiiiated to a greater extent 
in the present study than in the T y/a investigations may also in part 
account for the divergent results of ih< two studies. As will be shown in a 
later section, such motivational differences may play a significant part in 
the observed effects of nursery school experience upon tested intelli- 
gence. 

From the Institute of Child Welfare of the University of California, 
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Jones and Jorgensen (23) report data on a total of 54 nursery school chil- 
dren. Comparisons were made with control groups which had had no 
nursery school training but had participated in a similar program of serial 
mental tests. Socio-economic level and parental education were superior 
in both nursery and control groups. Follow-up studies included annual 
retests between the ages of 5 and 9. The tests administered were the Cali- 
fornia First Year Mental Scale up to 18 months of age, the Ciilifornia 
Preschool Schedule up to 5 years, the 1916 Stanlord-Binet at ages 6 and 7, 
and the 1937 Stanford-Binet at ages 8 and 9. 

For purposes of analysis, the total nursery group was subdivided into a 
number of smaller groups. First, 14 nursery children were matched with 
14 non-nursery children in parental education. These two groups showed 
no significant difference in the “growth curves” of their test performance 
at any age. In another comparison, 1 1 nursery children were matched with 
1 1 non-nursery children in mental test scores prior to nursery school at- 
tendance, as well as in the educational and occupational ratings of their 
parents. These two groups showed an increasing dilTercntiation in test 
score with age, but no one of the dilTercnccs in favor of the nursery group 
is statistically significant at any age. In this group, the parental occupational 
level proved to be slightly higher for the nursery group, the matching 
having been only approximately achieved. The authors point out that the 
actual discrepancy between the homes of the two groups ma>^have been 
even greater than the occupational index indicated, since the parents who 
sent their children to nursery school at some financial sacrifice were prob- 
ably superior. Thus in this stud\% the slight superiority in the home envi- 
ronment of the /iin scry group is offered as a possible explanation of their 
slight advantage in intellectual development. 

A third group of 29 nursery children who had been given a number of 
different intelligence tests were compared with six control groups. Of the 
latter, two were matched with the nurseiy group on the basis of initial IQ, 
two on the basis of terminal IQ (i.c., IQ after nursery school attendance), 
and two on the basis of IQ’s at ages 8 and 9. No significant difference be- 
tween control and nursery groups was found in any of these comparisons, 
the various control groups differing more among each other than from the 
nursery group. For example, a group w^hich had the same IQ as the 
nursery group at age 9 did not differ significantly from the nursery group 
in the testing prior to nursery school attendance. Or, conversely, a group 
matched with the nursery group in initial JQ did not differ significantly 
from it on subsequent tests. 

Of particular interest is the analysis of length of nursery school attend- 
ance. In a group of 66 cases whose nursery school attendance varied from 
50-99 to 450-499 days, the change in IQ on the California Preschool 
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Scale correlated .34 with length of attendance. The authors point out, 
however, that longer attendance was associated with greater number of 
testings in this study. As was shown in the preceding chapter, retesting 
will usually in itself raise scores. In the present study, the correlation 
between number of tests and IQ change was also .34. As might be ex- 
pected from these data, the partial correlation between length of nursery 
school attendance and IQ change, when number of tests was held con- 
stant, proved to be only .05. 1 his finding was corroborated in two other 
groups Irom the California growth study, consisting of 68 and 87 children. 
The corresponding partial correlations in these two groups were —.06 
and .03, respectively. 

From this brief examination of typical results it would seem that 
liUrscry seliool attendance may have a slight elfect upon the test per- 
formance of most groups of children. The fact that so many of the 
groups studied come from superior home environments would tend to 
obscure the influence of the nursery school and may account for the 
lack of difference found in some investigations. Most of the nursery 
schools in which thc.se investigations were conducted arc connected 
with universities and tend, on the whole, to draw children from rela- 
tively superior families. In all but a very few studies the initial average 
10 of the children was 1 10 or higher. If the children arc already in 
an environment favorable for the development of those functions sam- 
pled by the intelligence tests, it is diflicult to bring about additional 
improvements in 10 by special influences. A few of the investigators 
have recognized this factor, and there is some evidence that the effect 
of nursery school attendance may be greater with children from lower 
socio-economic levels (cf. 35, 39). 

That the differences observed are often insignificant may result, too, 
from the very small groups cmploN. d in many of the studies. To find 
that a difference is insignificant under these conditions does not dis- 
prove the existence of a real dift'crcncc — it merely fails to prove it. Such 
a finding certainly does suggest, however, that if there is a real effect 
of preschool attendance upon intelligence test performance, it must 
be slight. It has also been quite conclusively showm that, whatever the 
effect, it is manifested early, and longer preschool attendance 
has little or no further influence upon the IQ. The interpretation of 
these findings will be postponed until the concluding section of the 
present chapter, since there are a number of methodological problems 
which must first be considered. 
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THE EFFECTS OF SCHOOLING FROM ELEMENTARY 
SCHOOL THROUGH COLLEGE 

A few studies have dealt with the effects of schooling at the elemen- 
tary, high school, or college level upon intelligence test performance. 
Some of these investigations report only retest results on a single 
group — a fact which precludes a clear-cut interpretation of their 
results. For example, retests of college students with the American 
Council Psychological Examination after 1 to 4 years of college work 
generally show a considerable rise in mean score (cf., c.g., 4, 19). 
These gains may be wholly or in part the result of simple repetition of 
the test or of retesting with a parallel form. They may also reflect in 
part the general improvement which the group would have made 
within a year even without attending college. What the net effect of the 
college experience was in producing the observed gains cannot be 
determined solely on the basis of the given results. 

In a study (53) on children enrolled in three superior private 
schools in New York City, the results arc also difflcult to interpret for 
somewhat different reasons. The Stanford-Binct records of approxi- 
mately 3000 child^'cn, accumulated over a period of 20 years, were 
examined. Among these records were over 1100 retests gi\\'n after 
an interval of 2 V 2 years or more, during which the child had attended 
the particular school. A significant mean gain in retest 10 was found 
in one of the three schools, but not in the other two. Further analysis 
of the scores from the former school showed that the maximum gain 
occurred within a short period of residence, later gains being neg- 
ligible. The investigator offers no conclusive explanation of these find- 
ings, but suggests the possibility that “subtle and unidentifiable selec- 
tive factors” may have operated in the one school to produce the 
gains. The slight mean gains found in the other two schools arc no 
greater than is usually found in the repetition of a test. 

In connection with the Jowa project cited in the preceding section, 
Wellman (57) compared a group of 269 children attending the uni- 
versity elementary school with a group of 47 children attending other 
schools. The two groups were equated in age and 10 at the end of the 
preschool period. The average age at this lime was 5V2 years, and the 
mean IQ’s were 120.5 and 121.0, for the university school and other 
school groups, respectively. The former group gained an average of 
5.6 IQ points after nearly 4 years' attendance at the university cle- 
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mentary school, while the other group gained only 1 .2 points during 
the same period.^ 

An interesting comparison was made in an investigation (66) on 
rural children enrolled in the first three grades of consolidated and 
one-room schools in the same rural area. Stanford-Binel IQ’s obtained 
in the fall and spring for two years showed significant gains during the 
school sessions on the part of the consolidated school children, but 
onl> a slight change or a loss in the one-room schools. This dilTcrcnce 
was not related to family background or to home environment, but 
is attributed by the author to the superior educational facilities 
alTorded by the consolidated schools. 

Another approach is illustrated by studies on the relationship be- 
tween amount oj education and intelligence test score. I'hat a consid- 
erable relationship exists has long been a familiar fact. During World 
War r, when intelligence testing was still in its infancy, a correspond- 
ence between amount of schooling and intelligence test score was 
clearly demonstrated. Thus in a sampling of 48,102 recruits, the 
correlation between Alpha score and extent of schooling was .74 
(cf, 67). A similar relationship between AGCT score and extent 
of education was found in World War II. The establishment of 
such a relationship, however, does not in itself enable us to choose 
between the two alternative explanations, viz., ( 1 ) education raises 
the intellectual level, or (2) the brighter individuals are more likely 
to ‘"survive” the increasingly stringent selection of the successive 
educational levels. That the duration of any one individuafs education 
is not entirely dependent upon his ability is fairly obvious. Financial 
resources, family tradition a- J altitude!-, educational facilities in dif- 
ferent localities, and a number of other non-intellectual factors can 
readily be cited. 

An intert'sting cfl'ort to secure data bearing more directly upon this 
question is reported by Lorge (33). In 1921-22, 863 boys, consti- 
tuting a representative sampling of the public school population of 
New York City, were tested in the eighth grade with a number of 
psychological tests. Included among these tests were a reading and an 

^I'hc gain of ttu univeisity school chilJrt. -ms a critical ratio of 2.S4 and is 
thus moileraiely signiliciint, although lulling short of the coiv.entional criterion of 3.0. 
The control group gain is qinle insienrficani, havjtig a critical latio of .41. 

^Cf. 7 for the mean AGCT scores of men with different amounts of schooling. 
In a sampling of 4330 men, a correlation of .73 ‘vas found between AGCT score and 
highest grade completed jn school (cf. 47, p. '65). 
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arithmetic test which together yield a composite score reported to be 
essentially equivalent to the scores on most intelligence tests. Twenty 
years later, 131 of the original subjects, shown to be a representative 
sample of the total group in terms of original test means and SD’s, 
were given two group intelligence tests. Typical results on one of these 
tests arc reproduced in Table 10. 


TABLE 10 A Twenty-Year Follow-Up on the Effects of Schooling upon 
Intelligence Test Performance 


(From I^oTv’i, 33, ji. 487) 


Imiial hitelli^ence 

/ CM Scofc in 1921 ■ 49- 58 


Ih^hcM School 

Otis 


Grade ( ompleled 

Score 

N 

1 

8 

14.0 

4 

9 1 

19.0 

1 

10 i 

24 0 

1 

11-12 

21,0 

1 

13-14 



15-16 



17 or more 



Initial Intelligence 



'1 CM Si otc in 1921 ' 

79 88 


Highest School 

Otis 


Grade C (nnplctcd 

Scoic 

N 

8 

26 4 

5 

9 

3i 1 

8 

10 

28.5 

8 

11-12 

31.0 

9 

13-14 

34.7 

4 

15-16 

39.5 

6 

17 or more 

46 0 

5 


59 68 

69 78 

Otis 

Score N 

Otis 

Scoic 14 

22.0 4 

19.5 2 

22.0 4 

26.0 1 

22 0 1 

34.0 1 

1 

20.7 9 

14 5 2 

25.1 9 

3 1 .7 3 

26.0 3 

27.0 1 

38.0 3 

1 

i 89 98 

99 114 

Otis 

Scoic N 

Otis 

Stnie N 

39.0 2 

33.0 1 

38.0 2 

29.0 1 

37.0 4 

46.5 2 1 

41.0 3 

34.0 1 

41.7 4 

37.5 2 

53 5 2 

.50 8 5 

54.5 6 

43 0 1 


Tt will be noted that individuals who fell within a single class-interval 
in the original test, but who completed varying amounts of education 
in the interim, differ considerably in their performance on the 20-year 
retest. The mean Otis scores within each initial category show a fairly 
consistent rise as education rises. For example, among the 30 men 
whose 1921 scores fell between 69 and 78 (third column in 1 able 10), 
the 9 who completed only the eighth grade obtained a mean Otis 
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score of only 20.7 in 1941. The 3 who had taken graduate training 
beyond college averaged 38.0. 

If we consider the last four columns in Table 10, in each of which 
a full complement oi cducatamal levels is represented, and if we 
combine extreme groups in order to deal with somewhat larger sam- 
ples, we obtain the summary data shown in Table 11, According to 


TABLE 11 A Comparison of lixticnic Groups fto/u Table 10 


Iniiinl IntcUiytcncc 

7 cst Sc enc in 02 1 • 

6‘) 78 

79 88 

89 98 

99 J14 

Ih^hesi S, hoot 

Otw 

Oils 

Oh's 

Otis 

(hculc Cinnph'tcil 

Sn>rc N 

Scoie N 

N 

Score N 

8 10 

22. 1 20 

0 21 

37.8 8 

38.8 4 

1 5 or more 

3S.3 4 

42.5 1 1 

.S4.3 8 

49.5 6 


these figures, a diUcrence of about 7 or 8 >ears in schooling led to 
a mean difTercnce of from 10.7 to 16.5 points in intelligence test 
score. As a means of gauging the magnitude of this dilfcrcnce, we 
may compare the groups whicli received the same amount of educa- 
tion. but dilfered in initial intelligence test scores, i.c.. reading across 
fable 10. For example, among the subjects with only eighth grade 
education, the mean Otis scores obtained by groups which differed 
in their initial test scores range from 14 to 39. Among those with 
11 or 12 years of schooling, the means range liom 21 to 41. Within 
the highest educational level, with one or more years of academic 
work beyond college, the r. ’gc ‘S from 38 to 54.5. Thus the differ- 
ences between columns in d'able 10 appear to be about as large as 
those between rows. In other words the inter-group differences in 
adult scores were approximately as -arge when education varied as 
wdicn initial score varied 

In evaluating such a finding, a number of points should be borne in 
mind. On the one hand, the subjects upon whom these comparisons 
arc based were not exactly equated in iniiia! performance (cf. 14). 
Fach of the vertical categories in T ables 10 and I 1 covers a class- 
interval of 10 points in initial score. Fi om this fact, it might be argued 
that those boys falling near the top of this initial range in any one 
category were the very ones who continued their education longer 
and received higher scores 20 years later. The concTusiveness of the 
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demonstrated effect of schooling is further limited by the small num- 
ber of cases involved in some of the comparisons and by the fact 
that both initial and final tests inevitably fell short of perfect reli- 
ability. The chance errors thus introduced undoubtedly affect the 
amount — and possibly in some cases even the direction — of the 
obtained differences. 

These considerations would suggest that the apparent effect of 
schooling in the above study may be slightly overestimated. On the 
other hand, it should be noted that all the subjects in this study had 
the first eight years of schooling in common. They had, in fact, 
attended the same classes during their first eight years. This “con- 
stant,” added to each subject’s education, would certainly make the 
individuals more nearly alike than they would have been had some 
of them received less than eight years of education. If the range of 
education within the group had been wider — say, from the third grade 
to college graduation — there is no doubt that the net effect of school- 
ing on intelligence test score would have been greater. In fact, it is 
not unlikely that on most current intelligence tests the effect of the 
first eight years of schooling is greater than that of subsequent educa- 
tion. Beyond elementary school, the content of instruction is less 
standardized and uniform, and therefore less likely to bc^ sampled 
in the construction of intelligence tests. Thus the present study dem- 
onstrates that extent of education can influence intelligence test per- 
formance. But it would be un'warranted to generalize regarding the 
extent of such effect beyond the specific conditions of this study. 

Relevant data arc also to be found in a comparison of the intel- 
ligence level of soldiers in World Wars I and JI (55). A group of 
768 enlisted men, representative of the entire population of while 
enlisted soldiers in World War II, were given both the ACiCT and 
a revision of the Army Alpha. The distribulion of this group on the 
AGCT paralleled very closely that of the entire army. On the Alpha 
their median score was 104, in comrast to a median of 62 obtained 
in World War 1. The magniiudc of this difference can be more clearly 
envisaged when we consider that the median of the World War 11 
sampling corresponds to the 83rd percentile of World War I. In 
other words, 83% of the World War I group fell below the median 
score of the World War II sample. A number of factors may help 
to account for this marked improvement in intelligence level over 
the twenty-five years. Among them may be mentioned the later 
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group’s greater experience in taking tests in school, in industry, and 
in the army itself. The possible influence of better physical condition, 
as a result of improvements in public health and in nutrition, should 
also be considered. The major factor, however, appears to be the 
higher educational level of the population, together with probable 
improvements in the quality of instruction, length of school term, 
and the like. In the World War IF sample, the average education was 
10.0 years, i.e., two years of high school. The comparable World 
War 1 average was 8.0, or elementary school gracluatiem. 


METHODOLOGICAL PROBLEMS 

The methodological problems characteristic of studies on the effects 
of schooling arise largely from two necessary conditions of such in- 
vestigations, viz., lon(?iludinal observation and the comparison of 
matched groups. We shall consider some of the most persistent of 
these problems under seven major headings. Some are concerned 
with the choice of subjects, others with the measuring instruments 
or the particular conditions to be controlled in the course of the 
observations. A few deal with broader questions of the general plan 
or experimental design of this type of investigation. All these points 
have been cited and elaborated in the course of the controversy 
regarding the effect of schooling on the The present section is 
not, however, intended as a summary of the criticisms which the 
various workers in this field have directed against each other's re- 
search — the list would need to be much larger in that case! Our 
present concern is with the more general methodological problems 
which every investigator in this field must face, rather than with the 
minutiae of the shortcomings of specific studies. 

Sampling Problems. Considerations of sampling, cr the choice of 
subjects to be investigated, enter into “schooling” studies in several 
ways. First, because this type of investigation is generally based on 
longitudinal, “follow-up” observations, it is likely to be automatically 
restricted to a selected sample of the general population. Stability 
of residence and continued cooperate . a of parents would, for ex- 
ample, be necessary conditions for the inclusion of children in a 
follow-up study of several years’ duration. A group which is “selected” 


^^vSee especially 15, 16 22. 36, 37, 38, 51, 60, 63. 
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in terms of these conditions may in turn show other characteristics 
related to cultural level of the home, parent-child relationships, and 
the like. For these reasons, it is likely that the samplings employed 
in longitudinal studies tend to be somewhat superior to the general 
population. The reverse may be true in the case of institutional sam- 
ples, such as orphanage children. In this situation, the superior 
members may, for example, be more often removed for adoption. 
The enduring sample would thus represent an inferior selection. In 
either case, generalizations from a longitudinal sampling to the total 
population must be made with considerable caution and with due 
regard for the selective factors which may have operated in the 
particular situation. 

A second source of sampling difficulty pertains to the matching 
of experimental and control groups. Ideally, matched groups should 
be set up in advance by the experimenter, from the same population. 
In testing the effects of nursery school attendance, for instance, the 
experimenter would pair of! children in advance on the basis of 
matching characteristics, and would then assign one member of each 
pair to the nursery group and the other member to the control group. 
The choice within each pair would be purely random. 

In actual practice, investigations of schooling ha\e had to resort 
to a posteriori matching. Certain children within a community are 
entered in nursery school on the basis of their parents’ decision. Such 
a decision may itself reflect characteristics which distinguish these 
parents, their homes, or their children from others in the community. 
The investigator now' steps in and tries to find other children in the 
community who “match"' these nursery children in w'hat he considers 
to be important characteristics for his study. The difficulty lies in the 
possibility that one or more characteristics whose relevance to the 
problem at hand may have been overlooked will now be allowed to 
vary betw^een the two groups. If the assignment to nursery and non- 
nursery groups had been made in advance by the experimenter, these 
uncontrolled characteristics would probably vary at random in the 
two groups and no serious error would have been introduced in the 
results. But if special factors, such as parents’ decision to register 
their child in nursery school, determine the placement of the child 
in experimental or control group, then the uncontrolled character- 
istics may vary systematically, piling up an excess of one type of child 
in only one of the groups. 
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If, for example, children from the more “progressive” or “enlight- 
ened” homes are sent to nursery schools, then the systematic dif- 
ference in home atmosphere in favor of the nursery group might 
in time lead to superior development of this group, in contrast to the 
control group. Or it might happen that children who are inclined to 
be shy are more often sent to nursery school to enable them to over- 
come this difficulty. In such a case, the child’s shyness might handicap 
him on his initial intelligence test and lead to an apparent gain on 
a later test, when the shyness in the unfamiliar situation had de- 
creased. T hese examples are given merely to point up the dangers 
of a posteriori matching. No investigator can foresee or even subse- 
quently identify all relevant characteristics in which his control and 
operimcnlal groups should be equated. The random assignment of 
individuals to the two groups in advance is therefore an important 
safeguard against systematic differences in unmatched characteristics 
between control and experimental groups. 

In any comparison between matched groups, it is of course essen- 
tial that the groups be equivalent at the time when the comparison is 
made. Groups which were originally matched closely may become 
quite unlike through the selective dropping out of individuals, a selec- 
tion which may operate differently in experimental and control groups. 
Similarly, it would obviously be misleading to compare the average 
initial 10 of 100 children with the average 10 of 13 of these children 
who have remained in the study four years later. The only significant 
comparison in such a case would be that between the initial scores of 
these 1 3 children and their own final scores. 

Finally, the size of sampiing needed in this type of investigation 
should be considered. Many of the studies have been conducted on 
very small samples. Even when scvi ^al hundred subjects are included 
in the investigation, specific crucud comparisons have often been 
made between sub-groups of less than 50 cases. If the effects of 
schooling on intelligence test performance were very large and clear- 
cut, small samples would sufiicc to demonstrate the relationships 
under consideration. But the effects of scha'oling constitute a small 
part of all the influences which ma’^‘. for similarities and differences 
among individuals or groups. When effects arc relatively slight, they 
may readily be obscured by chance factors in a small sample and 
only insignificant differences will be obtained. 

Statistical Regression, As a general statistical concept pertaining 
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to correlated measures, regression has long been familiar. Its par- 
ticular application to the type of investigation under consideration 
(cf. 36, 52) is based upon: (a) certain sampling problems arising 
from the use of matched groups, which were discussed in the pre- 
ceding section, and (b) the fact that test reliability is always short of 
perfect, i.e., every test score contains some “error of measurement.” 
Upon retesting, regression may artifically produce two distinct effects, 
one pertaining to the relative position of individuals within the group, 
and the other to the relative standing of the two matched groups. We 
shall begin by considering the first of these two effects, since it is 
the simpler of the two. 

Statistical regression simply means that extreme scores on an im- 
perfect measure of any characteristic tend to “regress” or move toward 
the mean upon retesting. Such an effect occurs when two different 
tests of the same characteristic arc given, as wxll as upon the repeti- 
tion of a single test. For example, if a test for speed of tapping is 
administered to 100 subjects on Monday and again on Wednesday 
of the same week, a tendency will be found for those who scored 
far above the group average on Monday to fall closer to the average 
on Wednesday, and for the Monday low scorers to rise toward the 
average on Wednesday. Similarly, if a group of children are tested with 
the Stanford-Binet and the Mcrrill-Palmer Scales, those receiving 
high Binct IQ’s will tend to drop on the Merrill-Palmer and those with 
low Binet IQ’s will, in genera’, show a gain on the Merrill-Palmer.’^’^ 
It should be noted that the regression effect does not depend upon 
the sequence in which the tests are administered. It simply occurs 
in any comparison between the scores of the same group of indi- 
viduals on tw'o imperfectly correlated measures. For example, if we 
select the children with the highest Merrill-Palmer IQ’s in the above 
illustration, we shall find on the whole that their llinct IQ’s are not 
as high as their Merrill-Palmer IQ’s; those with low Merrill-Palmer 
IQ’s will tend to do better on the Binet than on the Merrill-Palmer. 
Similarly, in the tapping illustration, the Wednesday high scorers will 
tend to have performed more poorly on Monday, and the Wednes- 

This example is taken fr('»m actual results obtained in one of the Towa nursery 
school studies (cf. 59, p. 98), in which IQs on the Stanford-Binet (or Kuhlmann- 
Binct lor younger childicn) were compared with Merrill-Palmer IQ’s. The regression 
effect refers, of coiiisc, to the individual’s relative position in the group, not to 
absolute differences resulting from test standardization, practice, and th like. 
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day low scorers will have done better on Monday. Thus in any 
comparison between two measures which are not perfectly correlated, 
regression occurs in both directions^ i.e., it is a reversible effect. 

Such a regression effect results entirely from the '‘error of meas- 
urement” in the scores, I’hus, some individuals who received high 
scores on the first test did so in part because an error of measurement 
raised their score on that particular occasion. Since such errors of 
measurement are uncorrclated on two testings, this person will prob- 
ably score lower upon retesting, i.e,, by chance the error will be 
unlikely to occur in the same direction on both occasions. By the 
same token, some of the individuals receiving low initial scores did 
so because chance factors lowered their score on that particular 
occasion. To the extent that this was true, these individuals will tend 
to gain on a retest. 

It should be noted parenthetically that the error of measurement 
to which we refer need not be an “error” in the popular sense. It is 
an error only in so far as the two tests are attempts to measure the 
same behavior and would therefore be expected to yield identical 
scores. Any factor specific to one of the tests and not entering into 
the other would tend to make tlie two scores unlike and would consti- 
tute an “error” for the pre^^ent purpose. In the above illustrations, such 
an error would rcllccl the different fortuitous inffucnces which might 
raise or lower performance on any day on the ta[)ping test. It would 
also include the factors specific to such tests as the Stanford-Binct 
and the Mcrriil-Pa liner — factors which differ from test to lest despite 
the fact that both tests arc designed to measure intelligence. In its 
broader applications, then, regression occurs belW'Cen any tw^o meas- 
ures whose correlation is less than 1 .00 and which therefore include 
specific factors diff ering from one I ’Casure to the other. 

Not all changes in score, of course, are the result )f regression. 
With a highly reliable measuring instrument, the error of measure- 
ment will be slight and the scores will reflect more accurately indi- 
vidual differences in the ability being measured. It follows under 
these conditions that changes in score from lest to retest will depend 
more largely upon actual gains and <>sscs in the ability under con- 
sideration. With less reliable measures, however, the error of measure- 
ment constitutes a relative!) large part of the score, and regression 
effects will be greater, Jn the te'^ting of children of preschool ages, 



244 Differential Psychology 


the reliability of intelligence tests is relatively low.^^ Pqj- this reason, 
the regression effect becomes an important problem in the evaluation 
of data obtained on preschool groups. 

An observation repeatedly reported in a number of nursery sc]u)pI 
studies, both with superior nursery groups and with orphanage cliil- 
dren, is that the brighter individuals in the group lend to gain least 
or even to lose following nursery school attendance, while the duller 
members make tlic largest gains. This has been interpreted by some 
as a ‘leveling” effect of the nursery scliool expedience. It has bee!i 
argued that the “stimulating value” of each specific environment 
tends to make individuals approximate a particular intelligence level. 
Individuals above this critical point in their initial IQ will gain noth- 
ing and may even be “pulled down” to the performance level cor- 
responding to the environment in which they have been placed; tho e 
below this critical level, on the other hand. W'ill be raised by the 
environmental stimulation. 

The correctness of this explanation can always be checked by 
comparing the toKil i^roup variability before and after the interpolated 
experience (cf, 3b). If, for example, nursery school attendance really 
has a leveling influence upon the IQ, then the range of iiulividir.i! 
IQ's should decrease from the initial to the final test. Such a decrease 
in variability should likewise he discernible in a drop in the SD of 
the group. In the nursery school studies it has been demonstrated, 
however, that uuli vicinal dificrenccs do not decicase siipiificantlv diiriii:^ 
preschocA attendance. 1 he number of cases at dillcrent IQ levels 
tends to be l!ie same before and after such an experience, although 
different individuals fall into each IQ level on the initial and final tests. 
Thus what is actually occurring is that individuals are merely trading 
places upon the retest, rather than undergoing a leveling of ability. 

The distinction between Ic /cling and regression is illustrated schemati- 
cally m F igLire 4n. f or simplicity of di'>cussion wc shall a-'Sume that both 
distrihution> ha\'e identical means of lOO as well as identical vai iahihties. 
Part A of this figure show^ the clfeet of regression upon 10 hynolhcticul 
individuals selected because each received an IQ of exactly 120 on Tesi 1. 
It will be noted that, owing to “chance errors” and specific factors in the 
scores on Tests I and 2, these 10 persons “fan out” on Test 2. The average 
of the 10 however, is nearer to the distribution mean on Test 2 than 

^-Thc rciisoris fi)i such low reliability w.ll be d s' issed in a subsequent section 
on ihc instability of caily IQ’s. 
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A Regression Effect 

!Q on 
Test 1 


IQ on 
Test 2 




Fig. 46. Contrast befx^ een Regression and Leveling Effects, 
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it was on Test 1. Thus the mean of these 10 cases was 120 on Test 1 , but 
is only 107 on Test 2. It is only in this sense that the 10 individuals have 
regressed toward the distribution mean. At the same time, it should be 
noted that certain individuals who score 120 on Test I actually fall farther 
from the distribution mean on Test 2 than on Test 1. In the diagram, this 
is true of persons A, B, and C. The reader should visualize a similar 
“fanning out” of scores throughout the distribution of Test I . This means, 
first, that individuals receiving any one score on Test 1 are likely to spread 
over several scores on Test 2. Secondly, the average of these \ est 2 IQ’s 
will not fall as far from the Test 2 mean as the Tes‘ 1 IQ of these same 
individuals diverged from the Test 1 mean. 


10 on IQ on 

Test 1 Test 2 



Leveling cfTect is illustrated in Part B of Figure 46. Ten hypothetical 
persons having ditTercnt IQ’s on Test 1 arc shown to have moved closer 
to the mean on l est 2. It only leveling operates, this movement toward 
the mean would occur in each individual. Moreover, if we assume for 
clarity of illustration that no regression occurs at all in this comparison, 
but only leveling, then 10 persons wath identical IQ’s of 120 on Test I 
will also have identical IQ’s on Test 2, although they will be closer to the 
mean on the latter test. In the illustration given in Figure 46, the Test 2 
IQ’s of these 10 persons are all 107. It is apparent that, in true leveling, the 
variability of the group should drop from Test I to Test 2. Such a drop 
does not occur when only regression operates. 

An illustration of the “reversibility” of the concept of statistical re- 
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gression is given in Figure 47. Beginning with 10 hypothetical individuals 
whose IQ’s are all exactly at the mean of Test 1, we see that on Test 2 
their IQ’s spread over a considerable range. If, on the other hand, we 
begin with 10 individuals whose JQ’s fall at the mean of Test 2, the iden- 
tical effect is found, in the reverse direction, on Test 1. Since, unlike 
leveling, the regression effect occurs equally in both directions, it is obvi- 
ous that variability cannot decrease.^ * The tact that individuals only trade 
places in the regression effect, without affecting the range of scores, may 
be vividly illustrated if we observe the positions of individuals A, B, C, 
and D in Figure 47. ll will be noted that A and B move from the distri- 
bution mean on Test 1 to a high and a low IQ, respectively, on Test 2. In- 
dividuals C and D, on the other hand, who had a high and a low IQ, respec- 
tively, on 1 est 1 , fall at the distribution mean on Test 2, Whenever any one 
individual shifts toward the mean from Test I to Test 2, he obviously shifts 
away from the mean by an equal amount from Test 2 to Test 1 , 

So far, wc have discussed regression and leveling effects only with 
reference to individuals. Under certain conditions, however, regres- 
sion may also produce a difference between the means of two ini- 
tially matched groups. This will occur if the populations from which 
the samplings arc drawn differ in the characteristic in terms of which 
the groups arc matched. Regression of scores on a second test occurs 
toward the mean of •the population from which the cases are selected. 
If these means differ appreciably, then two samples of these popula- 
tions, which were deliberately matched in initial score, will diverge 
on a retest in the direction of their respective population means. 

For example, men and women as a whole will differ in performance 
upon a test of strength of grip.^* The male populauon will have an 
appreciably higher mean than the female population in this chcirac- 
tcrislic. If, now, we select a sample of men and women who are 
matched on a test of strength of grip, we will have to choose men 
from the lower end of the total male distribution in this characteristic, 
and women from the upper portion of the female distribution. A 
retest of strength of grip administered to these matched samples will 

In prcclictlnf: scores from a regression quation, the variability docs decrease. 
Thus if inc measiiied and the predicted scoics v- i relate .75, the SD of the picdicted 
standard scoics will be only thicc-fouiths as laigc as that of the actual standaid 
SCOICS. In the present situatiim, however, we aie dealing throughout with actual 
scoies. 

’•‘This illustration, as well as the! hypothetical data which follow, are taken from 
R. T. Thorndike (52, p. 91). 
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show that the women have regressed downward toward their popula- 
tion mean, and the men have regressed upward toward theirs. This 
assumes, of course, that each strength-of-grip test falls short of per- 
fect reliability. 

As long as the category in terms of which the population Is defined 
(in this case, sex) is itself related to the characteristic in which the 
subjects are matched (i.e., strength of grip), then the means on a 
retest will regress toward the respective population means. This fol- 
lows from the fact that whenever we choose individuals from the 
upper end of the total distribution, we are capitalizing on errors of 
measurement, i.e., on whatever component in the scores of Test 1 
is uncorrelated with the scores of Test 2. We tend to choose indi- 
viduals who have scored high not only in the components shared by 
Tests 1 and 2, but also in those specific to Test 1. The scores of the 
majority of such individuals are thus likely to drop on Test 2. The 
reverse will be true if wc choose a sampling of individuals from the 
lower end of the Test 1 distribution. 

A clear illustration of this regression effect is furnished by R. L. 
Thorndike (52, p. 91 ), using artificial data derived from dice throws. 
Such data have the advantage of demonstrating the regression effect 
which follows as a mathematical necessity from the given conditions, 
without the confusing interference of other unknown variables which 
might operate with real subjects. The relevant data for this illustra- 
tion will be found in Table 12. Scores corresponding to initial test 
and retest were found for 132 “men'’ and 132 “women,” constituting 
the two populations,^'* The mean difference between these two popu- 
lations on the initial test proved to be 4.7. From tliese two distribu- 
tions, matched samples of 64 men and 64 women were selected. 
The means of these two jnatched samples on the initial lest were, of 
course, identical, each being 20.3. When the retest scores of these 64 
men and 64 women were examined, however, the means of the two 
groups were 21.0 and 19.2, respectively. Thus the two samples which 
had been matched on the initial test regressed toward their respec- 
tive population means on the retest. 


For each individual, 7 dice were Ihiowii. The score on the initial test was the 
numbei of spots showing on dice 1, 2, 3, 4, and 5, and the score on the retest the 
number of spots showing on dice I, 2, 3, 6, and 7. In determining the “men’s’ scores, 
the same dice-throwing procedures were followed, and a constant value of 5 was then 
added to each score. 
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TABLE 12 The Effect of Regression upon Matched Groups 


(Data from R. L. Thorndike. 52, pp. 90) 



Population: 
Initial Ti'M Means 

Matched Samples: 
Initial Test Means 

Matched Samples: 
Retest Means 

Men 

111 

20.3 

21.0 

Women 

17.5 

20.3 

19.2 

Difference 

4.7 

0.0 

1.8 


The similarity between the pattern of this hypothetical situation 
and that represented by the nursery school studies is cletirly apparent. 
Jf preschool children as a whole (i.e., the preschool population) tend 
to come from superior homes and to have higher IQ’s than non- 
preschool children (for a variety of reasons unrelated to nursery 
schoi)! attendance), then upon retesting, the matched preschool group 
will tend to gain in 10 and the non-preschool to drop or remain un- 
changed. Each will thus have regressed toward its own population 
mean. In such a case, the matching of the two groups in initial IQ 
must have been accomplished by the inclusion of children from the 
lower end of the preschool population and the upper end of the non- 
preschool population, as in tiie hypothetical male and female popula- 
tions of the above illustration. The extent of this regression cITcct 
depends upon the amount of dilTcrcnce in initial test between the two 
populations, as well as upon the test-rctcsl reliability. 'Ihc relatively 
low reliability of intelligence test scores for young children, coupled 
with the practice of matching samplings a posterior!, w'ould thus make 
regression a serious problem in the pre.school studies. 

The Role of the Examiner. In . ny retest study, the "personal 
equation” of individual examiners niust be taken into account. In 
longitudinal studies with the Stanford-Binet on school children, for 
example, the IQ’s obtained by dilTcrcnt examiners on the same group 
of children have been found to vary considerably (cf., e.g., 10). Some 
examiners give consistently higher and some consistently lower IQ’s. 
Unlike group tests, most individual - ales permit sufficient latitude 
in both administration and scoring to make the lolc of the particular 
examiner a considerable one. In the testing of young children, the 
part played by the individual examiner is even more prominent. 
Whenever long-range studies necessitate shifts in personnel, it is 
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therefore cssenliaJ to check for any systematic variations in the results 
of different examiners. A gciin or loss of a few points from initial to 
final 10 might otherwise be attributed to an actual change in per- 
formance, when in reality it resulted from the use of different exam- 
iners in the administration of the two tests. A lack of difference 
between initial and final 10 might likewise be a spurious result, if a 
more lenient initial examiner happened to be balanced against a real 
gain in the subsequent test. 

An even more important precaution is to rule out any possible 
effects of the examiner's ^'mental set” 1 his can be successfully accom- 
plished onl\ when the examiners arc ignorant of the child’s previous 
test records and of his experimental classification. The examiner 
should not, for example, know when he is testing a child in the 
nursery group and when he is testing one in the control group. With- 
out such a safeguard it is very difficult to prevent unintentional bias 
from operating in cither the administration or the scoring of Ihc tests. 
This is, of course, the type of precaution regularly followed in any 
well-conducted laboratory study. But it has rarely if ever been thor- 
oughly applied in studies on the effects of schooling, especially at 
the lower age levels. 

An ‘‘expectation” that a particular child will do well or poorly 
may be established through the examiner's knowledge of the child's 
previous performance, or through the examiner's personal hypothesis 
regarding the probable outcome of the experiment. The halo effect in 
any situation calling for ratings or judgment is a familiar example of 
the influence of such expectations. But its operation in more objective 
testing situations has also been demonstrated. Goodenough (16), for 
example, cites a study on the errors made by school teachers in grad- 
ing spelling papers. A correlation of .40 was found between the 
number and direction of such errors and the ratings for “personal 
attractiveness” given by the teachers to the same children. Children 
who were rated as more attractive by a particular teacher thus tended 
to gain by the clerical errors made in the scoring of their papers; those 
rated as less attractive tended to lose. These errors occurred under 
conditions in which the teachers were endeavoring to avoid making 
any errors! 

Emotional and Motivational Changc.s. Part of any demonstrated 
improvement in lest performance following nursery school attendance 



Schooling and Intelligence 251 


may result from better emotional and motivational adjustment of the 
child to the testing situation. In such a case it would be necessary 
to consider the extent to which such improved attitudes toward adult- 
determined tasks might cxwt a general effect upon the child’s learn- 
ing and intellectual development. 

That frustration and other emotional experiences can significantly 
affect the level of performance on the Stanford-Binct or other similar 
tasks Is suggested by evidence on both preschool and school children 
(30, 31 ). Especially important in this connection is the characteristic 
resistance or negativism often encountered in young children. Such 
behavior serves as a constant error, always lowering and never raising 
the score. The extent to which this enor can influence test scores was 
demonstrated in a study on approximately 100 three-year-old chil- 
dren (44). Those tests which the child had refused to perform — but 
not those that he h<id failed — were repeated on successive days until 
they were definitely passed or failed. The effect of this procedure 
was to raise the Kuhimann-Binet 10 in over two-thirds of the cases. 
Among these, 18 children gained from 15 to 24 points, and 7 gained 
as much as 25 to 35. There is a strong probability that nursery school 
attendance nia'' reduce negativism, especially in a psychological test 
which has much in common with the nursery school situation. 

Comparability of Measuring Instruments. The use of different 
inrclligence scales, which may not be comparable, at different age 
levels is obviously a complicating factor in any longitudinal study. To 
bo sure, such a procedure does not in itself affect the comparison of 
experimental and control giOups, since both groups take the same 
test at the same time. It should be noted, however, that groups 
matched on one test cannot be assumed to be matched for other tests 
which may sample somevdiat different behavior functions. Hence it is 
obvious that the use of different intelligence scales for initial and 
final testing would make the results confusing and difficult to interpret. 

The use of different scales hy different investigators may also 
account in part for the apparent inconsistenc) of their results. It has 
been pointed out. ^or example^ that c'^iain tests, such us nic Ivlerrill- 
Palmer Scale, are very similar m v^ontent to typical nursery school 

Similar results were obtained with the Mcrrill-Palmer Scale, although the gains 
were considerably reduced wiru a correction for refusals had been made in the 
original IQ’s. Even these coriected scores, however, were raised from l lo 14 IQ 
points in ovci one-fouith of the j ases, upon completion of the “refusal” tests. 
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activities. Improvement on such tests following nursery school attend- 
ance is thus likely to be highly specific to the test functions and not 
diagnostic of other behavior. 

What is perhaps less obvious — although equally important — is 
that the same, scale may not yield strictly comparable results at dif- 
jerent ages. Many tests, including the Stanford-Binet, measure rather 
different functions in the early and later age levels. At the lower 
ages, such tests cover largely sensori-motor coordination, sensory dis- 
crimination, memory for objects, and similar siniplc behavior. As age 
level rises, increasing emphasis is placed upon verbal and other sym- 
bolic functions. In a study of many years’ duration, misleading dif- 
ferences between control and experimental groups might appear sim- 
ply because the groups have been equated closely in non-verbal but 
not in verbal functions. I’hus groups which (for any reasem) arc 
more adept in verbal and other symbolic functions than they arc in 
sensori-motor activities are likely to show a gain in 10 upon later 
retesting, regardless of any specially interpolated inlluences. To be 
sure, such a gain would also occur in the control group, provided 
that matching is adequate and extends to all relevant factors, such a.s 
home background. But if the subjects arc matched only in initial 10, 
as is often tlic case, a difference might appear between them in later 
tests, which has no relation to the interposed experiences. This dif- 
ficulty cannot be completely avoided if testing begins at very early 
ages, wlien the repertory of verbal behavior is still largely un- 
developed. 

A further dihiculty in the use of certain scales, such as the Merrill- 
Palmer, in longitudinal studies is that the meaning of a score varies 
at dilferent age levels, owing to the statistical cliaracteristics of the 
norms. For example, an 10 of 1 14 at one age may indicate the same 
degree of superiority as an JO of 141 at a later agc.^^ Even in such 
carefully conslruclcd tests as the 1937 Stanford-Binet, spurious 10 
changes may occur at different age levels (cf. 17). The extent of 
individual differences in 10 on this scale is not constant at all ages. 
Studies by several independent investigators have confirmed the obser- 
vation originally made by the authors of the test that 10 variability 
is highest at age IVi or 3, lowest at about age 6, and again reaches a 

This tollou's from the fact that the SD of the mental ages on this test Joes not 
increase with chronological age in such a way as to yield constant IQ’s at successive 
ages. Such an increase in vanabilily is a statistical picrccjuisitc for the ohiaining of 
IQ\s winch are comparable in meaning at dilTercnl ages 
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high peak at age 122^ These differences in variability are large enough 
to produce age changes of 15 to 20 points in the IQ of children who 
are far above or far below the group mean (2 to 3 SB’s away from 
the mean). Changes of 8 to 12 points in the mean IQ of groups com- 
posed largely of superior or of retarded children can likewise be 
brought about by these conditions. 

When the subjects in the control and experimental groups arc indi- 
viduahy matched in age, such shifts In the meaning of an IQ do not 
constitute a serious diOiciilty, although they certainly introduce con- 
fusion and awkwardness into the interpretation of results. If, on the 
othci Hand, the groups are only roughly equated in age, then serious 
enor may result in the comparative evaluation ol 10 changes in the 
experir.eijtal and control groups Moreover, any comparison between 
samplings fiom dillerent nopulations, varying in home background, 
parenial intelligence, etc , might be completely vitiated W these 
characteristics ol the measuring instrument. Any gap in perlorm- 
ance level initially existing between such groups may be spuriously 
enlarged for reduced) at different age levels. 

Care should also be taken to inmre '^hat the groups to be com- 
pared are given an equal nnmhei of iniclliitencc tests. Otherwise it 
may olten happen that children in lurscry school, or those attending 
a superior school, receive more practice either on the same test 
employed in the experiment or on closely similar tests. It will be 
recalled that scores on the second or third administration of a test 
are not directly comparable to scores on its first administration. 

Instability of Early IQ’s. That IQ’^ obtained in infancy and early 
childhood are relatively unstable is not surprising in the light of much 
of the preceding discussion. In a foil w\-up of 91 children of superior 
socio-economic and int^dlectu.il lev 1 (2) scores on items from a 
variety of widely used inlant tests, administcied from 3 months to 5 
years of age, showed little relation to Stanford-Binet IQ's at age 5. In 
another study (9), 138 children who had been tested between the 
ages ot 2 and 6, during the standardization of the 1937 Stanford- 
Binet, were rc-examined ten years ia^er. When scores on the same 
form ol the test were compared, the group initially tested at ages 2 

^^That these changes in "aJiabihty with age may themsehes have an environ 
mental cxpl ination — in teinis of the im teasing and deceasing uniformity of indi- 
vidual e'xpciientes between these aues — is beside the point The fan lemains lhai 
such ehanges oeeiir in the af^entc of anv cxpeiimentallv mlc polaled factor S'’eh .is 
spee al tranng intu>iluced in a.iv paiteiild' longitudinal study. 
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or 3 showed an average change of 13 IQ points; those children 
initially tested at ages 4 or 5 showed a mean change of 1 1 IQ points. 
The author’s conclusion from these findings is typical of the current 
view of most psychologists: although “the Revised Stanford-Binet 
Scale is as good as or better than any other objective index in pre- 
dicting the future intellectual functioning of a preschool child . . . 
an individual IQ obtained prior to the age of six years must be inter- 
preted with discretion.” 

A number of factors are undoubtedly responsible for the low pre- 
dictive value of IQ s obtained at the preschool ages. In the first 
place, the samples upon which the norms were established arc gen- 
erally not so representative as in the case of older groups, owing to 
the practical difficulty of gaining access to young children for testing 
purposes. The ncgativistic behavior characteristic of these ages, which 
may spuriously lower a child’s score on any one testing, has already 
been discussed. It is also likely that “intelligence,” heavily loaded 
as it is in our society with verbal ability, cannot be satisfactorily 
measured until the individual has attained a certain minimum of lin- 
guistic development. In a number of comparisons between the Stan- 
ford-Binct IQ’s of 5-year-olds and their performance on Vefrious 
infant and preschool tests, L. D. Anderson (2) concluded that lan- 
guage development and linguistic items have the greatest picdiclivc 
value. Comparison with later fO\ V’ould probably show the predictive 
advantage of such verbal items to be even larger. 

It should also be noted that, until they reach school age, most 
children have not been exposed to a sutfieienl body of uniform ex- 
perience — later furnished by the relativjy standardized school cur- 
riculum — to permit an adequate sampling of common intelJcctua! 
tashs for testing puiposcs. lhat marked individual changes in IQ do 
not occur haphazardly, but may be related to experiential factors, 
is suggested by a further analysis by Bradway (9) of the lO-ycar 
changes in Stanford-Binet IQ cited above. Fifty subjects showing the 
largest test-retest changes were selected from the total group of 138 
children. Detailed information on these cases was obtained through 
home interviews and visits, from which quantifiable data on thirteen 
home and familial characteristics were derived. A comparison of the 
26 children showing IQ gains with the 24 who showed losse- indi- 
cated that all of the factors studied were related to IQ changes. 
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Highly significant differences between the two groups were found, 
for example, in parental intelligence.^** 

Finally, mention may be made of the possibility that the relative 
“constancy of the IQ” observed at later ages is itself an inevitable 
mathematical consequence of the cumulative nature of behavior de- 
velopmcnt.-^ The individiiars behavior equipment at each age in- 
cludes, in general, all his earlier behavior equipment, plus an incre- 
ment of new acquisitions. Even if the annual increments bear no 
relation to each other, a growing consistency of behavior level would 
appear, simply because earlier acquisitions constitute an increasing 
proportion of total behavior as age increases. Predictions of IQ from 
10 to 16 would thus be more “reliable” than from 3 to 9 because 
the scores at 10 include a larger proportion of what is present at 16, 
while scores at 3 include a smaller proportion of what is present 
at 9. 

It should now be apparent that the “instability” of early IQ’s de^ 
pends only in part upon shortcomings of the measuring instrument. 
Some of the observed instability follows from the characteristics of 
behavior development itself. Predictive value of the IQ over periods 
of more than a year cannot be regarded as synonymous with test 
reliability in the accepted sense (cf. 49, 63). If genuine changes in 
performance level occur during such an interval, the scores on a highly 
reliable insirumcnt will — and should — change. Body weight at the 
age of six months, for example, may correlate very low with body 
weight at age 40. Prediction from the former measure to the latter 
would be hazardous, and yet such measures may have been obtained 
with scales of nearly perfect reliability. Logically, a test may have 
high reliability — and validity — at a ] articular age level, despite the 
fact that it does not permit accurate jong-rangc predictions. From a 
practical viewpoint, such a test would still have good predictive use- 
fulness, i.e., predictions could be made from the specific behavior 
sample of the test to other behavior of the child at that particular 
age level. 

Whether parental intelligence operated as environmental innuciice or through 
some unknown hereditary stu'ctuial characteristic is, of course, not indicated by sulIi 
a study I'he data are cited here only to suggest that the “instability"’ of early IQ’s 
may not be wholly a result of the uMieliability of the measuring instrument, but may 
be dermilcly traceable to intcivcnmg environmental influences. 

^"Cf, J. K. Andfison’s (1) concept of “overlap,” to be discussed more fully in 
the following chapter. 
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The investigator who conducts a longitudinal study involving the 
testing of young children ought certainly to take every precaution to 
insure high test reliability. Thus adequacy of norms, good rapport, 
and the elimination of the effects of negativism arc important pre- 
requisites for unambiguous results. But the remaining “instability of 
early lO’s,” resulting from such factors as insufTicicnt verbal develop- 
ment, lack of highly uniform training, or the sheer paucity of the 
early behavior repertory, ought to be regarded more as an observed 
datum of behavior than as a weakness of the tening procedure. 

Seasonal Variation. Among the results cited by Wellman (60) as 
evidence that the observed changes in ]Q were actually attributable 
to nursery school attendance is the fact that the mean score rose 
regularly in the spring and fell or showed negligible change in the 
fall. The explanation advanced is that during the summer months, 
when the children did not attend nursery school, they were not ex- 
posed to the stimulation which brought about the spring gains in 
score. Seasonal changes in test performance have, however, been 
found in other investigations at the preschool level, as well as at later 
ages. Several studies (20, 21, 32) of non-nursery children of pre- 
school age have show'n that, in these gioups too, a larger gjfm in 
score occurs over the winter than over the summer interval. It thus 
appears that attendance at nursery school was not the factor respon- 
sible for the differential changes during summer and winter months. 

Seasonal differences in traditional activities, such as the shift from 
outdoor and gross sensori-motor activities during w'arm weather 
to indoor games and closer adult contacts during the winter months, 
may affect the child's “rapporf’ in the test situation. An optimum 
“warming-up period” in the type of functions sampled by intelligence 
tests may bring the child to his peak performance level at some time 
in the late fall or early winter, A month-by-month analysis of groups 
taking their initial and final tests in different months, with a six-month 
interval, showed the maximum gains in Movember, December, or 
January (20, 21 ). A considerable number of other seasonal factors, 
including holiday periods, nearness to vacation time, weather con- 
ditions, mounting ennui after a long period of similar activities, and 
the like, may also contribute to these results. 

Whatever the causes, however, the fact that seasonal variations in 
test performance have been observed needs to be taken into consid- 
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eration in interpreting retest results. Changes in performance which 
normally occur at different times of the year cannot be attributed to 
the influence of specially interpolated experimental factors, such as 
nursery school attendance. Moreover, temporary changes resulting 
from reaction sets, warming-up, and similar influences ought to be 
distinguished from more permanent experiential effects. 

IMPLICATIONS OF THE EFFECTS OF SCHOOLING UPON 

TEST PERFORMANCE 

Tn the light of the above survey of methodological problems, it ap- 
pc.ii's that studies on the effects of schooling upon intelligence test 
performance, under present conditions, arc not well suited for an 
analysis of the heredity-environment question. At the nursery school 
level, where the majority of the studies have been conducted, no effect 
of preschool attendance upon 10 has been conclusively demonstrated. 
It is likely that, when various methodological difliculties are elimi- 
nated, a slight effect remains, but this may be the result of improved 
rapport which is higlily specific to the test situation. To find that 
preschool attendance docs not directly improve intellectual functions 
is certainly not surprising. Nursery school curricula were not designed 
for this purpose and have little direct bearing upon the verbal and 
other symbolic behavior functions which constitute so large a part of 
“intelligence.” Their expected effectiveness in “raising the 10” is 
fu^-ther reduced by the fact that, in most of the studies, the children’s 
home environments were already furnishing superior intellectual 
stimulation. 

As for the few investigations on he contribution of subsequent 
schooling, they too represent a relatively ineffective approach to the 
study of environmental influences. The experimental design of such 
studies involves more similarity than diflerence in the experiential 
background of the contrasted groups of subjects. Among the different 
types of schools compared in most studies, the uniformities of instruc- 
tional techniquc'i and facilities seem k. more conspicuous than their 
diversities. Simdlarly, the compulsory education requirements in Amer- 
ica furnish a common core of initial training upon which subsequent 
diflerences in amount of education are superimposed. The elementary 
school years may be of particular importance in the establishment of 
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work habits and in the development of the types of behavior which 
are most prominently sampled by intelligence tests. Similarities in 
home environment and in other out-of-school experiences add to the 
common core. When subjects have so large and so important a part 
of their environment in common, the observable effect of any environ- 
mental differences among them will be diluted. In summary, we may 
say, first, that studies on schooling have not furnished satisfactory, 
conclusive proof of large environmental effects. Secondly, such a find- 
ing is to be expected because of the experimental design of most of 
these studies. The studies on special educational programs set up 
for specific groups represent a more direct and better-controlled ap- 
proach, but the data gathered by this method are still meager, though 
suggestive. 

A further implication of the schooling studies which needs to be 
examined pertains to the relationship between “intelligence tests” and 
“intelligence.” Much confusion has resulted and conflicting asser- 
tions have been made because of unrecognized assumptions regarding 
this relationship. One finds, for example, a tendency for writers on 
both sides of the controversy to “blame the tests,” for opposite rea- 
sons. Thus on the one hand, appears the statement that if the 10 
is showm to be inconstant and subject to modification, then the tests 
must be an unreliable or unsuitable measure of “intelligence.” It has 
been argued (13), for example, that if intelligence tests prove to be 
susceptible to environmental changes, they must be heavily loaded 
with “experience factors” and ought to be revised.-* Such statements 
obviously contain the tacit assumption that “intelligence” is not sus- 
ceptible to environmental influences. Obviously, a procedure which 
sets out to reduce the contribution of a particular factor precludes 
the possibility of studying its influence. 

On the other side, v\c find the proposal that since “intelligence” 
is susceptible to environmentally determined change, less emphasis 
should henceforth be placed upon mental testing in the schools, in 
order to avoid “a harmful and unwarranted label for a child” (,S(), 
p. 536). ft has even been argued (cf. 1 1) that intelligence testing is 
undemocratic! To maintain that, because intellectual functions are 
susceptible to improvement by training, we should stop testing intelli- 

Actually, an effort is generally made to reduce “differential experience factors” 
in the selection of items foi inclusion in intelligence tests. In fact, some writers have 
even suggested that, because of this practice, the tests are loaded in favor of 
heredity! (cf., e.g., 64). 
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gence, is equivalent to suggesting that we stop observing behavior. It 
also seems to identify schooling with the entire reaclional biography 
of the child, ignoring influences outside the school situation. The 
value of psychological tests has been empirically demonstrated in 
many situations One need not believe in the hereditary fixity of the 
IQ in order to employ psychological tests. Such tests enable us to 
gauge what the individual can do in his present state of development 
To expect the tests to predict what the person will be capable of doing 
20 years hence is to demand that the psychologist be a fortuneteller. 
Since we cannot foresee (except roughly) the experiences which any 
particular individual wall have within 20 year*?, we obviously cannot 
predict his behavior \ery accurately over sueh an interval. This is 
particularly true, of course, if the test is given at an early age, when 
only a small fund of behavior has already accumulated. 

Both of the^e extre.ne \iows illustrate a failure to recogni/c the 
proper relationship between the test and the behavior which is being 
tested, Actuall}, every intelligence test is only a sample of behavior, 
a small part ol the type of behavior w'hicb w^e call intellectual. It 
follovvs that any observation made regarding test performance is ipso 
facto an observation regarding behavior. There is no sharp distinction 
between those factors w'hich inllucnce lest performance and those 
whic h iniliience inieiligcnce. I he difleience is one of degree, or of 
breadth of inlluence. Ihus certain factors, such as negativism in test 
>ituations, may be narrowly limited in their area of opeiation, al- 
thcnigh it is dmibttul that any influence is completely restricted to the 
test situation. The develop* eiit of certain woik methods and the 
acquisition of techniques for the symbolical manipulation of materials 
undoubtedly have a broader area of .ipplication. The various con- 
ditions which alTccl lest periormance probably fall into a continuum 
in reference to the breadth c^f their inlluence upon behavior. 

Finally, mention may be made of ihe notion implicit in some dis- 
cussions that “intelligence” is not definable in terms of observable 
behavior, but is a hidden, uiiexplorablc entity or potentiality which 
remains immune to change while bei'a'inr may alter conspicuously. 
Such a concept of an entity which is jiot susceptible to observation 
and about which no statement can therefore be either proved or dis- 
proved has no place in science.-**’ 

For a tuiihcr ciitical eval uiion of the concept of “potenjality,” the reader is 
lefencd io C haptei 2 
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A DISTINCTION HAS FREQUENTLY BEEN MADE bctWCCH development 
through specific practice or training in a given activity and develop- 
ment through maturation or ''growth’* (cf. Ch. 4). Such a distinction 
docs not imply a dichotomy between inherited and acquired be- 
havior. I’hus maturation is not regarded as independent of environ- 
mental stimulation of a general sort, nor is learning necessarily 
considered to be exclusively determined by environmental factors. 
When we speak of growth, wc usually think of a definite sequence of 
developmental stages in the structural characteristics of the indi- 
vidual. As the child grows older, for example, his height increases, his 
bodily proportions arc altered, and many other well-known physical 
modifications occur. Such changes lake place regardless ol the specific 
training which the individual may have had. 

As structures become altered with age, so wc may expect their 
functions to undergo change. With stronger muscles, the older child 
can learn to walk, climb staus, sit up, and perform various other 
tasks much more readily than his younger brother. It is reasonable 
to expect that certain types of activity \ ill in general appear at fairly 
dehnite stages, since they ie([uire a specific degree ol structural de- 
velopment tor their execution. Very intensive training at an earlier 
age may produce almost negligible cITccts when compared with the 
achievements of an older child with only a minimum of training. 

Since such a large share of infant behavior consists in the acquisi- 
tion of motor skills and sensori-motor l - '•dinations, activities which 
are closely linked to structural factors, growth lather than practice 
seems to play the major part in early behavioral development. It is 
quite a different matter, however, to use the concept of growth to 
describe the intellectual and emotional development of the older child. 

265 
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Such a concept has nevertheless been commonly employed in inter- 
preting age changes in mental test performance, and the curves plotted 
to portray these changes have been labeled “mental growth curves.” 
Such growth curves arc diflicult to interpret for many reasons and 
their use has led to much technical controversy. 

It should become apparent in the course of the present chapter 
that the distinction between investigations of “training” and those of 
“mental growth” is a superficial one. It is only for convenience, 
therefore, that the former were discussed in the preceding chapters, 
while the latter have been reserved for the present one. The data 
on both topics should he considered as a whole. A few studies, in 
fact, arc difficult to classify into one or the other category; this is 
especially true of experiments on very young children, such as those 
reported in Chapters 5 and 6. For the purposes of the present discus- 
sion, however, we may regard studies of “growth” as those in which 
mental test progress at successive ages is observed and charted, with 
no attempt to alter the normal course of development. 

THE GROWTH CURVE 

Growth curves were first plotted to show the development of physi- 
cal trails, such as height, weight, bodily proportions as indicated by 
various indices, and the like. An example of such a curve, showing 
the changes in height in groups of tall, average, and short girls be- 
tween the ages of 5 and 17, is given in Figure 48. As a descriptive 
technique for portraying more vividly the course of development of 
structural characteristics, the growth curve has proved serviceable 
and intelligible. The physical dcitr are relatively easy to interpret and 
unambiguous. By analogy, however, attempts have been made to plot 
curves of '“mental growth,” a procedure which has brought additional 
confusion into an already difficult problem. At best these curves arc 
only a descriptive summary of changes produced by a multiplicity 
of factors. By lumping all sucli factors together and giving them a 
semblance of systematic growth, the main issues are often obscured. 

We shall first examine some of the principal methodological prob- 
lems met in the measurement of “mental growth.” A consideration of 
such problems goes far toward explaining the discrepancies and dis- 
agreements among different investigators. Typical findings on age 
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changes in test performance among children as well as adults will be 
discussed in subsequent sections. 



Age m Years 

Fig. 48. Growth Curves in Height. (From Baldwin and Stcchcr, 4, p. 13.) 

Cross-Sectional versus Longitudinal Comparison. Because the re- 
testing of the same individuals year after year is very time-consuming, 
grow til studies ha\ e trecjucnlly employed a cross-sectional approach. 
For example, groups of subjects ranging in age from 12 to 20 are 
tested simultaneously and th'v average score of each age group is 
plotted against age. It is assumed that these averages indicate the 
normal course of development and Fiat they approximate closely 
the scores which would have been obF.ined if, say, (he 12 -year-olds 
had been retested annually until they reached age 20. 

Such an assumption is open to question for at least some of the 
groups which have been te^tcd. The diflVrent age groups may not 
be comparable because of progressive selcrtiv*' faciors. High school 
seniors, for example, are a more highly .select group than high school 
freshmen, since the poorer students tend to drop out in the course 
of their high school work. If, as has often been the case, the subjects 
tested were in school, the higher average score of the older subjects 
may result in part from this selective dropping out of the less able 
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students. Had the same subjects been tested in the freshman and 
senior year of high school, the mean gain in score might thus have 
been much smaller. 

A further objection to a cross-sectional approach is that the 
experiential backgrounds of ihe different age groups may not be com- 
parable. 'Fliis is especially evident when comparisons arc made be- 
l^^een widely disparate age groups. For example, the differences 
between present-day 40-ycar-old'> and present-day 15-ycar-olds can- 
not be attributed entirely to factors associai 'd with age. At the time 
when today's 40-ycar-olds \^ere 15, schooling was poorer, opportuni- 
ties lor certain ol actnity were less frequent or even non- 

existent, and many socia' altitudes were probably qiiile different from 
those current today. Such comparisons are thus complicated by the 
fact tliat older and >our.gci groups were brought up under different 
conditions, owing to general cultural changes which arc constantl) 
occurring.^ 

Paitlv in recognition of these difficulties and parll\ because better 
facilities for growth studies ha\e become available, an increasing use 
of longitudinal studies is now being made. To be sure, the longi- 
tudinal approach aho pre cuts its own peculiar dillieultics. *Vlosl of 
these have alrcad) been discussed in connection with effects of school- 
ing (ef. C'h. 8). Some ol the dilheulties. however, do not apply to 
studies bastd up('n a ‘ingle gDup; others are lemediable if the in- 
vestigator is aware ol them. Peihaps the most serious weakness o** 
longitudinal go^wlli studies is the somewhat select nature ol the 
participating groups, a fact which results from the prerec|uisites of 
stability of residence and continued cooperation with the in\e >tigator. 
At the wwst, however, such selection limits the scope of the results, 
but It doc^ not invalidate them if the population to whkh the) apply 
is clearl} specified. 

Among the most extensive longitudinal growth studies mav be men- 
tioned the Berkeley Growth Stud} at the Institute of rihlj Welfare 
at Berkeley. Cahlornia (2‘M; the several Harvard Growth Studies 
(14, 52); and the ambitious rcseaich progiam of the Samuel S. 
Pels Research Institute at Yellow Springs, Ohio, which is concerned 
with nearly every phase of the individual's development from concep- 

* Cf. eg, llic siiiiufsti\c findiiu^s of Giinilhclj (U)) on the rcl.itionsaip between 
neurotieism and the socioeconomic conditions prevailing at the time when the 
individual rejchcd matuiity. C f. aUo Kiihlcn (29) foi a general chsciissiun of the 
factor ol sovial change in age compaiisuns. 
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tion to maturity (50). Approximately 300 children and their families 
who live in the neighboring communities are the subjects fpr the Pels 
studies. Of the tlarvard Growth ^Studies, the two earlier studies cov- 
ered only certain aspects of physical growth. The third, completed 
in 1938, was based upon a wide variety of physical, psychological, 
and educational measurements of approximately 3500 school children 
in three Massachusetts cities (14). These children were first tested 
upon admission to the first grade and were retested annually for 12 
years. Jn the fourtli Harvard Growth Study (52), initiated in 1930, 
testing was begun at the time of birth. Approximately 100 children 
of each sex are being followed up through periodic examinations 
in thi.^ project. 

Average iwsifs Individual Curves, A further objection lo the 
cross-sectional approach is tliat it permits the plotting of only average 
curves. Since dhferent [)ersons arc tested at each age level, it is 
obviously impossible to chan the progress of individual cases. Even 
when longitudinal data are collected, moreover, the common practice 
is to plot the average score for each age. Such a procedure may con- 
ceal significant variations from individual to individual. If the devel- 
opment ()f any particular function varies markedly among dilfcrent 
individuals, such didcrenccs would probabl> cancel out in the average 
curve, dhc resulting curve might thus be cjuite unlike the actual 
course of development for any individual, 

A clear-cut illusiraiion of the piv vible elTecrs of the indiscriminate 
averaging of individual growth curv'c,. is provided by the findings on 
the prc-pubcrial spurt oj g/ru//i. Ihc individual growth curves for 
many phvsical traits show a spurt or sudden increase in the rate of 
growth shortly before puherty. Since iuJividuals differ in the age at 
which tlicy reach puberty, Niich a spurt of growth would fall on dif- 
ferent portions of tiic growth curve for different individuals. The 
curve based upon guiup averages would therefore reveal no spurt 
at any period, since this phenomenon would be completely masked 
or obscured. When only individuals reaching puberty at the same age 
arc included, how^wer, the pre-puhesv * growth spurt becomes 
clearly evident, as illustrated in Figure 49. 1 his curve, taken from the 
third Harvard Growth Studv, shows the average annual increase in 
height of each of eight groups of girls, classified ciccording to age of 
puberty. The group reaching puberty before 1 1 Vi ycar«, for example, 
has its maximum spurt of gnwth at age 11. At the other extreme, 
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those reaching puberty after \AV 2 show their most rapid growth at 
about 14. 



Age in Years 


Fig. 49. Avcnigc Annuiil IncrcmciUs in St.ind.rig HeiglJ (’jT I-’ighl CYroiips 
of Girls Rcciching Piibcilv al Dillcrcnl Ages. (From Slum. can orth, 4S, 
p. 32.) 

Difficulty Level of (he Test. The form of the growth curve is 
also affected by several characteristics of the test or measuring instru- 
ment employed to gauge the amount of progress. Among such factors 
is the general difficulty level of the test. In a relatively easy task, 
performance will improve rapidly during the hrst few years and 
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more slowly later on as a perfect score is approached. In a relatively 
difficult task, on the other hand, or in a task which requires a certain 
degree of general information or mastery of techniques before it 
can be properly executed, progress will be slow at first and much 
more rapid at the upper age levels. The latter task would thus give 
a positively rather than a negatively accelerated curve. These effects 
can be illustrated by means of performance cur^es of successive 
school grades on easy and difficult sentences in the Trabue Sentence 
Completion Scale (16). The curves for tivc representative sentences 
of different degrees of difficulty are shown in Figure 50. 



School Grade 

Fig. 50. Grade-Progress Curves for Completion of Sentences of Varying 
Difficulty. (From Freeman, 16, p. 336.) 

Test Ceiling and Test Zero. If the difficulty range of the lest is 
narrow, performance may be artiiicially cut off at cither the upper 
or lower end, or at boffi ends. Thus if the '^ceiling' of the test is too 
low for the abilities of the older subjec’^ tested, there will not be suf- 
ficient items at the difficult end of the ..cale to permit thcs(' subjects 
to show improvement. Although the subjects’ ability may actueilly 
increase from, say, 18 to 19 years of age, their scores on such a test 
may show little or no progress, since their ocrformancc is close to 
a perfect score. 1 he effect which this is likely to have on the form 
of the growth curve is illustrated schem; -dly in Figure 51. Although 
the ability which is being measured may continue to increase by equal 
amounts in successive years (i.e., along a straight line), the scores 
begin to taper off as they approach the arbitrary test ceiling and 
will stop rising altogether when the maximum score is reached. An 
equally aitificial slowing down of progress may result at the early 
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ages from the use of a test whose arbitrary zero point is too high for 
the subjects. Thus if a particular test has too few easy items to sample 
the performance of the younger 
subjects adequately, the curve 
will probably rise slowly at 
first and then rapidly, i.c., it 
will be a positively accelerated 
curve. 

Inequalitv of Test Units. The 

scores on most psychological 
tests do not correspond to equal 
units of ability, 'the possible 
effects of such inequalities upon 
the measurement of impro\e- 
ment with practice were dis- 
cussed in Chapter 7. The cffLcts 
upon growth curves are of a 
similar nature. Let us suppose 
that on a certain test the difference in ability required to improve 
from a score of 50 to a score of 51 is considerably greater than 
that rec[uired to progress from 20 to 21. Such a discrepancy would 
tend to make progress appear slower at the later ages, since the 
50-to-5I step is more likely to fall within the performance range 
of the older subjects, and the 2t)-to-21 step within that of the younger 
subjects. 

A special illustration of the influence of test units upon the form 
of the growth curve is furnished by mental age curves. If average 
mental age is plotted against chronological age, the result will arti- 
ficially resemble a straight line. Any divergence from a straight line 
in such a graph simply indicates errors in test standardization. It 
will be recalled (cf. Ch. 2) that age scales arc so constructed that 
the average child will advance one year in mental age for each year of 
chronological age. The successive mental age units are thus adjusted 
so as to rule out automatically any differences in amount of improve- 
ment from year to year. Such units arc therefore unsuited to a study 
of the course of intellectual development. 

Type of Abilities Measured at Different Ages. When a complex 
scale such as the Stanford-Binet is employed, it is likely that dif- 
ferent abilities arc measured at different age levels. At the upper 



Fig. 51. The i:ffecl of a 1 ow Test 
Ceiling Upon the Form of the 
Growth Curve. 
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ages, most intelligence tests are heavily loaded with verbal functions 
and other abstract and symbolical tasks. At the other extreme, infant 
tests are largely based upon sensori-motor development. It is also 
possible that what appears superficially to be a uniform task may call 
different activities into play at different age levels. For example, 
the same form board which measures predominantly spatial percep- 
tion at age 4 may measure chiefly speed of movement at age 10. It 
is apparent, therefore, that any one growth curve may in reality 
consist of several overlapping curves for different functions. 



Fig. 52. Age Changes in Variabilify in Different Functions. (From Bay- 
Icy, 7, p. 59.) 


It is interesting to note in this connection that such shifts from 
one function to another at different ages seem to be accompanied by 
rises and falls in variability, i.c., in the extent of individual dif- 
ferences. The SD of total scores on infant tests, for example, has 
been found to rise sharply to a peak at six months of age; it then 
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drops gradually until the age of one year, after which it rises again 
slowly (7). It has been suggested that such shifts in the extent of 
individual dillerences may parallel the development of separate func- 
tions. dims as the sensori-motor functions approach maturity near 
the end of the first year, individual dillerences in these functions — 
which liad previously been large — show a decrease. At this same time, 
the learning and “adaptive' items, which enter increasingly into the 
total lest performance as the cliild grows older, introduce a second, 
gradual rise in variability. The correctness of tins explanation of the 
observed shifts in variability is further indicated by a separate analy- 
sis of age changes in SD in the sensori-motor and in the adaptive 
items, dhe results ol this analysis are presented in Figure .S2. It will 
be noted that the sensori-motor items show a steep rise up to the age 
of 6 months, followed by a shaip and continued drop, with no subse- 
quent rise. 1 hus as maturity in these behavior functions is approached, 
individual dilTerences gradually disappear. The adaptive functiems, on 
the other hand, show a gradual and continuous rise in variability. 

Composite Nature of Most Growth Curves. Fwen at a single age 
level, the t unctions involved in most f Psychological tests are veined 
and complex. An individuafs score on such a lest generally depends 
upon his abilities in a number of diherciit functions. Fven if essen- 
tially the same lunclions are measured by such scores over the age 
range tested, il is neveitiicless true that the resulting growth curve is 
a composite of several curves, hach of the conlribuling functions 
may develop at a dilferent rate and leaeh '‘maturity’' at a din’erent 
age. To be sure, if the composite is consistently and unambiguously 
defined, age changes in such a compo>ite may be signifie.mt in them- 
selves. The growth curve of height, for example, may be analyzed 
into separate growth curves lor limbs and trunk, wdiieh develop at 
different rales. It is still both practically and scientifically useful, how- 
ever, to measure age changes in total height. But the composite height 
measures of diflcreni investigators liave tlic same composition — a fact 
which is clearly not true of diderent intelligence tests. Many psycho- 
logical tests purporting to be equivalent may thus yield diverse growtli 
curves, because of the varying combinations of functions which enter 
into each test. 

Three illustrations of such composite behavior indices will be con- 
sidered. The first deals with the extent of activity in the human fetus 
at different prenatal ages. Figure 53 shows average results for 16 
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fetuses observed during normal gestation (49). The top curve, indi- 
cating the per cent of time the fetus is active at different ages, resem- 
bles the familiar negatively accelerated growth curve. When, however, 
the total fetal activity is subdivided into the three commonly observed 
types, three very dillcient curves are obtained. Small, rhythmic move- 
ments show little or no increase with age. Kicking shows a sharp rise 



53. “(irowlh Curvc.s" lor 1 ypcs of Moven ent during 

ihenutal tale. (I’rom Sonlug, 49. p. 1.2.) 


from 5 to 3 moiUiis prior to birth, followed by a drop. Squirming 
movements, on the other h:md, increase in tnunency throughout the 
observation f)criod, rising slowly at lir i and more rapidly later. 

T he second illitstratii>n concerns the i.xquency of crying by infants 
during the lirst year of life (b). A record was kept of all instances 
of crying by 61 infants in die course of monthly physical and mental 
examinations. The total figures suggest a general tendency for the 
amount oi crying ( frequency and duration) to decline until about four 
months of age, then to increase again, especially after six months. 
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The amount of crying drops once more beyond six months, but in- 
creases slightly toward the end of the first year. On the surface, such 
a finding might suggest a cyclical development of emotional behavior 
in the infant. The apparent periodicity, however, may result from the 



Age in Months 

Fig. 54. A gc Chiingjs in (Tying Behavior in Response to DilTerenl 
I'ypes of Sliiniiii. (Ihoin Bviviey, 0, p. 320.) 


combined cfTccts of a number of independently varying factors. In the 
present investigation, crying in response to cliflercnt types of stimuli 
yielded age curves differing in both form and direction. Three of 
these curves are reproduced in Figure 54. ll will be noted that “cry- 
ing as a result of restriction of movement and unaccustomed position” 
retains a relatively high frequency throughout the first year, with no 
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consistent downward or upward trend. “Crying from fatigue” shows a 
fairly steady drop from the first to the twelfth month. The reverse 
trend is evident in “crying because of strangeness of persons or 
places,” which mounts steeply throughout the year. The apparent 
periodicity in “emotionality” would thus seem to result from a com- 
bination of many specific emotional responses, each of which follows 
its own independent course of development. Such findings suggest 
that in another investigation a different trend in the composite crying 



Fig. 55. Age Changes in Two Mechanical Ability Icsls. (From Jones 
and Seashoic, 2S. p. 141.) 


curve might be produced by alter! .g the relative frequency of the 
specific stimuli which evoke crying. 

The third illustration is furnished by the scores made by a group 
of adolescent boys in the sfimdardization of the Minnesota Mechanical 
Aptitude Battery (cf. 28). Figure 55 shows the age differences in 
average standard scores on two of the tests in this battery, the Spatial 
Relations and the Assembly tests U .dl be noted that the curve for 
Spatial Relations exhibits a definite negative acceleration, rising 
more sharply until about age 15 and then slowing down. The curve 
for the A'^sembly test, on the other hand, follows almost a straight- 
line trend, with minor fluctuations. 
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Similar examples could be cited from the various phases of linguis- 
tic growth, age changes in different types of memory tests, and the 
development of many other functions. It should be evident that the 
so-called curve of mental growth is not one, but many curves. A few 
of these curves run parallel, others move along simultaneously but 
at different rates, while still others succeed one another in over- 
lapping steps. 

Age Progress Curves. When applied to psychological test scores 
and other behavior data, the term “growth curve” may be quite mis- 
leading. What such a curve actually shows is the performance of the 
individual at different ages in some standard test situation. Such a 
curve docs not differ in any essential respect from a learning curve. 
In both cases, the subject is tested under similar conditions at suc- 
cessive intervals and his progress is charted on the curve. Learning 
curves, to be sure, usuall}/ cover a shorter period of time than growth 
curves, although a practice experiment could conceivably extend over 
several years. Lhe major dilference between learning curves and 
growth curves seems to be that in the former the subject is given 
special training under rigidl>' controlled experimental conditions, while 
in the latter he is left to his own resources, d'luis it would sticm that 
a psychological growth curve is at best a practice curve obtained in 
the absetice of controlled conditions.^’ It reflects the cumulative effects 
of the random training and experience of everyday life, without add- 
ing anything cssculially new to the picture. 

It follows from this discussion that growth cur^cs may vary with 
the cultural milieu in which they <ire obtained. If the lu:irning con- 
ditions differ from one gremp to amuher, the curves of psychological 
growtii may likewise be expected lO differ. Such “growth cur\cs'* can 
still serve a useful purpose as desciiptive devices. y\s such they nmy 
indicate the general course of development of different functions under 
given cultural ccmditiofis, and would charactcri/c mdividuals of dif- 
ferent age levels within a specific group. For such curves, the term 
“age progress curve” would seem a more accurate designation than 
‘'growdh curve,” since it provides a more realistic description of the 
type of data from which the curves arc derived, 

-This view is to be contraacd with that of Coartis fl2, ]'’»), who regards gro\\th 
as development under anistaut enviioninentai cont'aioiis. Although C'ouiMs also ccni- 
siders piactice and giovvih etiives as furHlameiilaljy similar, he subsumes practice 
curves under the heading of growth emves, rather than vice vcisa. It would appear 
more realistic, however, to legard growth eiuves as a type of piacticc curve. 
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TYPICAL FINDINGS ON Till-: IMPROVEMENT OF MENTAL 
TEST PERFORMANCE WITH AGE 

In view of the- many diniculties enumerated in tlic preceding sec- 
tion, the reader may wonder what, if anything, can be learned from an 
examination of the age curves themselves. Idrsl, it is clear that such 
curves show changes only in the particular area of behavior which the 



Fig. 56. Age Changes in Staiiford-hinct Performance Expressed in Abso- 
lute Scale Units. (1 roni Thu. ame and Ackeison, 02, p. 57h.) 


test samples. Secondly, the results emnot be assumed to hold for in- 
clividuals whose experiential background is markedly unlike that of 
the group on whom the curve was derived. Ilnrdly, the units in which 
test scores ate expressed and plotted present a persistent problem, to 
which various alternative solutions have been proposed. Certainly, 
some type of equal-unit score would seem io permit a more intel- 
ligible picture of annual progress than is furnished by unequal raw 
scores or by anificiall> adjusted menial age units. With these points 
in mind, we may examine the ret.uits of some of the most carefully 
conducted investigations. 

Perhaps the most widely quoted curve is tliat prepared by Tliurs- 
tone and Ackerson (62) from the Stanford-Binet scores of 4208 sub- 
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jects between the ages of 3 and 18. The scores were first converted 
into an equal-unit scale.*"* The resulting curve is reproduced in Figure 
56. It will be noted that this curve rises slowly at first, then more 
rapidly, and then slowly again as the final leveling-off is approached. 
Thus the curve is described as positively accelerated in its early stages 
and negatively accelerated later on. The portion of the curve extend- 



Age in Months 


Fig. 57. Age Changes in Test Perl ormance during Infancy, Expressed 
in Absolute Scale Scores. (From Ba\ lcv, 7, p. 4^.) 

ing below three years of age was found by extrapolation. Some con- 
firmation of its shape, however, was provided by the data of Baylcy 
(7), which were plotted by Thurstone in the same equal-unit scale. 
This curve, based upon retests of infants in the Berkeley Growth 
Study from shortly after birth to age 3, is shown in Figure 57. It will 
be seen that the curve shows positive acceleration, especially during 
the first seven or eight months of life. 

A curve similar in general form to that of T hurstone and Ackerson 
was found in the third Harvard Growth Study, through retests of 522 
children between the ages of 8 and 15 (cf. Fig. 58). The findings of 
the earlier cross-sectional study arc thus confirmed by a longitudinal 
approach. In another longitudinal study, Freeman and Flory (17) 

® “Absolute scale” units, whose derivation can be found in T hurstone and Acker- 
son (62). 
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retested 469 children annually with a battery of four tests. The testing 
began at age 8 and extended over a ten-year period, more than half 
of the subjects receiving five or more successive retests. The test bat- 
tery (designated VACO) consisted of vocabulary, analogies, comple- 
tion, and opposites. The composite curve showed an almost linear, 
or uniform, increase from ages 8 to 15, with only a slight decline in 
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Fig, 58. Age Changes in Intelligence Test Scores in a Group of 522 
Children Rclc-aed between the Ages of Eight and Fitleen. (F"rom Dear- 
born and Rothney, 14, p. 215.) 

rate beyond that age. Considerable progress was noted even at the 
highest ages tested. Pintner and Stan^m (44) administered the CAVD 
cxaminalion annually to 140 children in grades I to VIII, each child 
being tested over a period of from two to six years. This test consists 
of four parts — completion, arithmetic, vocabulary, and directions — 
which recur at each of the dilficulty levels. A particular advantage of 
the CAVD for such a study is that its units progress by steps of equal 
difficulty. The average progress cur o obtained in this study showed 
some negative acceleration, thus confirming the usual results for these 
age ranges. The mean annual gain dropped from 1.23 between the 
ages of 7 and 8 to .60 between the ages of 13 and 14. 

In conclusion, it may be noted that when short periods of five years 
or so are studied, especially during middle childhood, the age progress 
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curve for most “intelligence” tests appears to be approximately a 
straight line. Negative acceleration, or slowing down, sets in as the 
leveling-off point is approached. In other words, cessation of progress 
occurs gradually rather than suddenly. As for the lower end of the 
curve, in infancy and early childhood, its form is still rather uncer- 
tain, owing to insufficient empirical data. Finally, it should be added 
that many investigators have found marked imJiviclifal differences in 
the form of the “growth curve.” Cycles of slow or rapid development, 
which may cover several years, occur in many individual curves, with 
no regularity from person to person.** Evidently ihe detailed course 
of beha\ior development is inlluenccd by many factors — structural 
and experiential — which vary with the individual. 

ADUI T INTI LI IGENCL 

The study of maturitv and old age is a very recent but rapidly grow- 
ing branch ot psychology. Such interest in the characteristics and 
the problems ot older persons has taken many forms. A number ol 
research projects on fairly large groups of adults have been concerned 
with changes in intelligence, special aptitudes, or cmcTionat charac- 
teristics. Special efforts have been made to study representative sam- 
ples at the various age levels, in contrast to the rather atypical groujis 
in the earlier studies on older persons. An increasing concern with the 
vocational and personal guidance of older persons and with the clinical 
treatment of maladjustments at these age levels is also noticeable. 
Several books, round-table discussions, and even a special division of 
the American Psychological Association devoted to problems of ma- 
turity and old age furnish further evidence of the status of this area in 
contemporary psychology (cf. 27, 30, 31, 32, 42). Mention may also 
be made of the construction of intelligence tests specially designed 
for adult groups (20, 63). 

Limit of ^‘Mental Growth.” The more recent and better con- 
trolled studies have generaPy found that intelligence test performance 
tends to improve until the very late teens or early twenties. Some 
evidence on this question is furnished by test norms, although when 
the tests are standardized on school populations, selective elimina- 

'^Cf, eg., Baylcy (8), Dear bom and Rothney (14), Freeman and Hoiy (17), 
and Jones and Conrad (26). 
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tion is likely to make the norms spuriously high at the upper ages. 
In the Terman-McNemar Test of Mental Ability (55), the norms 
were corrected for such selection and can therefore be regarded as 
more nearly representative of the performance of comparable groups 
at each age. The average annual gain in standard score on this test 
dropped gradually from 7 points between the ages of 1 0 and 1 1 to 
3 points between the ages of 18 and 19. It is evident that, although 
the annual increments tend to diminish, performance is still improv- 
ing at age 19. 

Other cross-sectional comparisons were made by Teagarden (54) 
on an orphanage sampling and by Jones and Conrad (25) on the 
entire population of certain New England villages. The results of 
these two studies also show gains in test scores up to age 18 and 
probably even later. In a survey by Miles and Miles (41 ) on groups 
ranging in age from 7 to 94, the Otis Intelligence Test scores con- 
tinued to rise until about age 20. Thi?> w;is also the approximate age 
at which improvement ceased in the standardization data of the 
Wechslcr-Bellevuc Intelligence Scale (63). 

Longitudinal studies on the same individuals have likewise fur- 
nisned evidence that intelligence test scores continue to improve 
until the age of about 20 (cf. 17, 56). In one ol these studies (17), 
some of the subjects who had been first tested at nine years of age 
v/erc followed into college and were still making sigjiificant gains at 
the termination of the survey. In this connection may also be cited 
the retest studies on high school and college students with parallel 
forms of the Psychological Examination of the American Council on 
Education (5, 23, 37, 61 ). Large and consistent gains upon retesting 
were found in all these studies, not only in average score, but also 
for nearly every individual. Witht 't the use of non-college control 
groups, it is of course impossible .o determine ihc extent to which 
such gains are attributable to college training and to other more 
general conditions. From a purely descriptive viewpoint, however, 
the fact remains that gains in intelligence mst performance were made 
consistently by these subjects, who ranged in age from the middle 
teens to the late twenties. There 's evidence, moreover, that 

individuals who continue their education longer tend to improve in 
intelligence test performance until a later age. 

Course and Amount of Decline. Closely related to the question of 
'dntellectual maturity*’ is that of the subsequent decline of abilities 
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with increasing age. A few investigations have been specifically directed 
to this problem. In one of the first systematic surveys of the intel- 
ligence of older persons, Jones and Conrad (25) gave the Army 
Alpha to 1191 persons between the ages of 10 and 60, constituting 
nearly the entire population of 19 villages in rural New England. 
Miles and Miles (41 ), also in the effort to obtain roughly comparable 
samplings at different age levels, obtained adult subjects through 
lodges and social groups. Their total sample consisted of 823 subjects 
ranging in age from 7 to 94, all of whom were given a shortened 
form of the Otis Self-Administering Intelligeticc Test. In the process 
of standardizing the Wechslcr-Bcllcvue Intelligence Scale, Wechslcr 
(63) obtained scores from 670 children and 1081 adults ranging up 
to 69 years of age. All the subjects in this survey lived in New York 
State, the adults being selected so that the occupational distribution 
for each age level resembled roughly the corresponding distribution 
in the national census data. 

The age curves obtained in these three investigations arc given 
in Figure 59, each curve showing the general trend of average scores 
with age. In order to make the data of the three investigations com- 
parable, scores on all three tests were first converted into standard 
scores (cf. 27). It will be noted that rate of decline is ste*cper in 
the Wechslcr and in the Miles and Miles studies than in that of Jones 
and Conrad. This difference may be in part the result of sampling 
irregularities at the upper ages. Differences among the tests employed 
in the three studies, however, are probably a major factor. 

The consideration of these average trends must be qualified by 
the fact that indii'idual differences were large at all ages, with exten- 
sive overlapping of different age groups. Variability, in fact, tended 
to increase with age, despite the decline in average scores. Such 
increases in variability are especially noteworthy in view^ of the pos- 
sible effect of selection in the older age groups. Thus among the 
oldest groups, more individuals pleaded exemption from the tests be- 
cause of failing eyesight, reading difficulties, and the like. Moreover, 
the less energetic and possibly less intelligent older persons were less 
likely to come to the centers where the tests were given. In the Jone.S“ 
Conrad study, for example, a sharper age decline was found among 
the subjects tested in their own homes than among those tested in 
community centers. These selective factors would tend U) reduce 
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variability at the upper ages, by elimination of individuals at the 
lower end of the distribution. Hence if all subjects had been tested, 
the obtained rise in variability would probably have been still greater. 
It would seem that, as the individual’s experiential background is 
enriched with age, more sources of variation in behavior are intro- 
duced and individual differences continue to increase. 



Fig. 59. Intelligence Test Score in Relation to Age: A Comparison of 
Re.ulls from I'hrce Investigations. (From Jones and Kaplan, 27, p. 72.) 


The overlapping of ilie various age groups is such that individual 
differences within any one age level arc much larger- than the dif- 
ferences between age groups. Thus the brightest persons in even the 
oldest groups tested were still conspicuously better than the dullest 
persons in the younger groups Furtf er corroboration of such a finding 
is provided by a study on a small sampling of persons of uniformly 
high educational level (53 > In a comparison of the psychological 
tcist performance of 45 university professors, aged 60 to 80, with a 
comparable group of 45 academic men between the ages of 25 and 
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35, individual differences were again found to be much more impres- 
sive than age differences. 

That age in itself is a poor guide to ability level is further illus- 
trated in Figure 60, which is based upon an analysis of the Miles 
and Miles data. The adult subjects were classified into four levels 



Fig. 60. Age Changes in Intelligence Test Scores at D.irercnt I'diica- 
tional Levels. (From Miles and Miles, 41, p 70. j 

in terms of the ariKHint of formal education they had received. Idic 
highest lc\el (A) consisted of college graduates who had rccci^'ed 
additional professiouiil or graduate training; the lovv^est level (D) 
extended from a total lack of formal education to elementary school 
graduation. Although all four groups show a decline in mean Otis 
score with age, the four curves neither cross nor meet. In other words, 
the higher educational groups retain their superiority consistently at 
all ages. It should also be noted that the lowest point on curve A, 
reached by the 70-ycar group, is still higher than the highest points 
of curves C or D. Thus a 7()-ycar-old person who had pursued at least 
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one year of graduate work would be expected to score higher than a 
20-ycar-old elementary school graduate. 

One of the most significant findings of adult testing has been the 
specificity of age changes. The curve of dec) me, even more than the 
curve of growth, varies with the type of ability measured. Some abili- 
ties increase with age, some decline, others sh(;w liltle trend in cither 
direction (cf., e.g., 51). An illustration of this fact is to be found 



Age m Yedrs 


Fig. 61. Decline oT Abiliiy in DilTcrcnl Funcuons. (From Jones and 
C onrad, 25, p. 250.) 

in the Jones-Conrad study. When scores on each of the sub-tests of 
the Army Alpha were plotted against age, several dissimilar curves 
were obtained. Three of the.^e curves, for the tests of general informa 
tion, arithmetic reasoning, and verbal analogies, respectively, are le- 
produced in Figure 61. The con tin rise in general information 
beyond age 20 is typical of findings by other investigators, as is the 
slight decline in arithmetic reasoning. The steeper drop in verbal 
analogies may result in part from the relative prominence of speed 
in this particular sub-test and in part from the nature of the test 
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itself.''* In another investigation (21), a vocabulary and three per- 
formance tests were administered to 375 men and 268 women of 
low socio-economic level, ranging in age from 15 to 76. The vocabu- 
lary test showed a rapid rise with age from 15 to 20, and slower rises 
up to 55, when a slight drop was found. The scores on the three 
performance tests, on the other hand, declined rapidly at the older 
ages. The greater stability of vocabulary tests among older persons, 
as contrasted to the rapid decline on many other tests, has been 
corroborated by a number of investigators (cf. 3, 47, 64). 

Tests which emphasize speed have regularly shown a particularly 
steep decline with age. This finding has been reported consistently 
in investigations in which the performance of younger and older 
adults was compared on power as well as on speed tests (cf. 32, 39, 
42). For example, when subjects ranging in age from 15 to 75 were 
given the Otis Self-Adminislcring Intelligence Examination by a 
work-limit rather than a time-limit method, the younger adults re- 
quired less time to complete the test, but the accuracy scores showed 
virtually no relation to age ( 1 1 ). In another study (38), three groups 
of subjects, aged 20 to 25, 21 V 2 to 37 V 2 , and 40 to over 70, respec- 
tively, vv'crc matched in CAV^D scores. It will be recalled tlmt this is 
a pure power test, given with practically no time limit. When the 
three equated groups (totaling 143 cases) were tested with the Army 
Alpha and the Otis, the average scores on these tests dropped pro- 
gressively from the youngest to the oldest group. Speed plays an im- 
portant part in performance on the latter two tests. 

A further point to note in evaluating the age decrements in mental 
test scores is that nearly all our available tests have been conslrucied 
on young people. Asa result, they are overloaded with tasks typical 
of the activities of young persons (cf. 20, 42). The content might be 
quite dilTcrcnt if the tests had been constructed by sampling the 
activities of older persons. It is likely, therefore, that most available 
tests tend to favor young persons unduly. The fact, for example, that 
most intelligence tests arc so heavily v/eighted with academic content 
reflects the school activities of the young subjects on whom they were 

^ Tt will be noted in Fipurc 61 that the general level of scores in the thret. tests 
differs, despite the use of T’-scorcs in plotting all these curves. This results fjom the 
fact that the T-scaJing was done on the basis of the scores of a sub-saniplc of 200 
adults aged 25-39, rather than on the basis of the cntiie sampling. 

^ Knox Cubes, Porteus Mazes, and Ferguson Form Boaids. 
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standardized. The inferior performance of older groups on such tests 
may result partly from the greater distance in time of the older per- 
son from his own formal education. The loss would thus be more a 
matter of forgetting and lack of practice than one of dctcrioriation. 
Moreover, in a number of comparisons in which level of education 
w\as not held constant, it is probable that the older persons had re- 
ceived less education than the younger, since the general level of 
education has risen considerably during the past two or three gen- 
erations. 

Adult Learning. 1 he notion that “you cannot teach an old dog 
new tricks” is a common one in popular thinking. Adults frequently 
deplore their inability to learn a new language, a new motor skill, or 
an improved work method as well as they could in their younger days. 
Closer observation reveals, however, that the condhious of learning 
are far from cumpaiable at dilTcrent age levels. The time available 
for learning, the distractions, and the motivation for learning are 
often very different for the child and the adult. The learning of new 
skills is frequently undertaken casually and halfheartedly by the 
adult, while for the child or adolescent it is the core of his serious 
responsibilities, other activities being “extracurricular.” 

When older and younger persons learn under comparable con- 
ditions in an experimental situation, the differences in iheir perform- 
ance are relatively slight. In a scries of investigations with a variety 
of tasks (57). an average decline of less than 1% a year in “sheer 
modifiability” was found between the ages of 22 and 42. This decline 
was manifested principal!} •- the more meaningless tests of rote learn- 
ing, such as drawing lines of given lengths blindfolded, learning a 
code, or memorizing numbers paired with nonsense syllables. In most 
other tasks, the older persons could ^'ompensate for any loss in learn- 
ing ability by greater inlercst, better sustained effort, and a larger fund 
of relevant experience. Yor example, in stenography and typewriting, 
in learning Es[)eranlo, or in universily courses, the progress of the 
older persons equalled and sometimes e^en excelled that of the 
younger. 

When the new learning runs counter to previous learning, it is 
reasonable to expect older adults to be handicapped. This is a simple 
result of inicrjcrence, oi “negative transfer,” which has no necessary 
connection with age as such. There is some experimental evidence 
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(46) lo show that tasks which are hindered by previous experience 
suffer a greater age decrement than those which arc benefited by 
such experience. In a learning experiment involving the comparison 
of three different age groups, the older subjects were found to be 
inferior to the younger subjects in the learning of all types of material, 
although they were lcs.> inferior on the more meaningful material. 
For example, the older subjects did relatively best in learning paired 
associates which were meaningfully connected, such as nest-owl, soft- 
chair. Their performance was poorer in learning “nonsense” material, 
such as A X M B or N X M^C, and poorest in memorizing ma- 
terial which conllicted with previous learning, such as 3 4 2 or 

3 X 1 1. Some of the attitudes and emotional reactions charac- 

teristic of “old age" may have a similar explanation (30). The greater 
“conservatism” commonly attributed to older persons may simply 
mean that the longer one has held a certain opinion, in general, (he 
more firmly fixed it becomes. Older persons, for instance, were found 
to be less susceptible than younger persons to suggestion from either 
group opinion or the opinion of experts (40). Such reactions may be 
explicable in terms of the cumulative effects ot previous experience, 
without resort to unknown physiological bases. Through all^this con- 
sideration of adult learning, we must not, m(3rcovcr, lose sight of the 
wide individual differences and extensive overlapping ot age groups, 
as conspicuous in this area as in any other age comparison. 

The Age of “Maximum Productivity.” Another approach to the 
study of adult abilities has been through an analysis of productive or 
creati\'c work in such fields as science, literature, and art (33, 34, 35). 
To be sure, selected cases can be found to illustrate maximum pro- 
ductivity at almost any adult age in individual scientists, inventors, 
writers, musicians, or artists. In terms of group trends, however, fairly 
consistent age curves have been found. In Figure 62, for example, is 
shown the average number of “best books” by 101 noted writers dur- 
ing successive five-year periods of their lives. Consistently similar 
curves were obtained when the lists of “best books” prepared by 
different accepted authorities were consulted. It will be noted that the 
most “productive” years for such writers fall between the ages of 40 
and 50. Similar surveys among scientists and inventors, on the other 
hand, show^cd maximum productivity to occur between 25 and 40, 
with a subsequent dropping off in later years. The specificity of such 



Age Differences 291 

trends is further illustrated by the finding that the peak of production 
among musicians differs with the type of music. Similarly, among 
writers the peak occurs earlier for poets and later for writers of his- 
torical, critical, philosophical, or scientific works. 

Although of interest in themselves, such data on productivity do 
not tell us very much about the rise and fall of abilities in general. In 
the first place, the subjects are certainly a highly selected group and 
not typical of the general population. 1’hc possibility that the age of 
maximum production docs not coincide with the age of qualitatively 
best production lor each individual rnusl also be taken into account. 
It may well be that at certain ages quantity is sometimes sacrificed for 
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Fig. 62. Ago Changes in Liierary Production. (From Lehman, 34, p. 66.) 


the sake of quality in creative work. Thus in a survey of the age at 
wliich over 4000 scientists produce 1 their ‘'chief work,” the median 
age was found to be 43 years t I ). 1 his is considerably older than the 
“age of maximum productivity” reported above for scientists. 

A possible decrease in motivaliou because of financial and profes- 
sional security, development of other inlere.sls, and the like, may also 
affect productivity among some older persons. Closely related to this 
factor is the commonly noted increa'<;c in administrative duties with 
increasing age, especially among acads-iuic persons (cL, e.g , 9). Such 
responsibilities sometimes seriously interfere with “creative activi- 
ties.” Finally, the results of such surveys may be specific to the par- 
ticuliir historical period covered and may vary as social conditions 
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vary. A recent analysis has shown, for example, that present-day 
leaders in a number of fields are definitely older than were their prede- 
cessors who held the same nominal positions (36). 

THE CONSTANCY OF THE IQ 

The widely debated question of the constancy of the IQ at different 
ages can be better clarified if it is regarded as two separate questions. 
The first is the purely empirical, practical, “actuarial” question of 
prediction. It is generally recognized that the intellectually gifted 
school child is likely to develop into a superior adult, and that a 
feebleminded child will probably fall below average as an adult. Just 
how accurately can such predictions be made, and how early in the 
individual's life? These arc the practical questions of prediction, con- 
cerned only with observed trends and regularities. The second ques- 
tion is a theoretical one, in which the degree of constancy of the IQ 
is considered as an index of the regidarity of mental development. W’e 
shall sec that the answer to the first question does not necessarily 
imply a corresponding answer to the second. 

Empirically, the IQ has been found to remain sufficiently constant 
during the elementary school years to make prediction over several 
years feasible. Among older subjects, intellectual level likewise shows 
considerable stability, especially when individuals remain in fairly 
constant environment^'-. Thus the intelligence test scores of college 
students correlate very highly with the scores obtained by the same 
individuals in high school or even in the upper elementary school 
grades (15, 60). In one study, for example, a correlation of .80 was 
found between scores on the American Council Psychological Exami- 
nation administered at college entrance and intelligence test scores 
obtained as early as the seventh grade of elementary school (15, 
p. 476). It is of the utmost importance, in interpreting such results, to 
realize that only subjects who had continued their education to the 
college level were included. If the investigators had worked from the 
other end, by following up a group of elementary school graduates 
and retesting them after five or six years, the correlations would prob- 
ably have been much lower, since the intervening educational and 
other experiences of the subjects would have undoubtedly varied much 
more widely. In a group with comparatively constant educational 
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experiences, however, individuals tend to maintain the same relative 
position in intelligence test score over a period of many years.^ 

In general, there are two major exceptions to the constancy of the 
10. First, large shifts in IQ may occur among individuals who have 
undergone fairly drastic environmental changes, such as placement in 
a foster home or participation in a specially designed and intensive 
remedial program (cf. Ch. 8). Secondly, preschool tests have proved 
to be of little or no value in predicting lO's in adoic^cence and adult- 
hood. 

Evidence for the latter finding is plentiful. For example, in a group 
of 123 children, performance on the Gescll schedule at 6 months 
correlated only .37 with Meirill-Palmcr scores at age 2 (43). In the 
same group, a correlation of .46 was found between the initial Gescll 
test and the Stanlord-Binct IQ at age 3. In the course ol the Berkeley 
Growth Study, 61 childien were retested regularly from the age of 
one month to 9 jears. From an analysis of their scores, Bayley (8) 
concluded that available intelligence tests for infants and young chil- 
dren cannot be used to predict later ability. Tests given at age 4 may 
permit grade school predictions within wide classifications; tests 
between 2 and 4 will predict 8- or 9-year performance with some suc- 
cess; but scores obtained before 18 months of age arc completely 
useless in the prediction (T ahililics during school ages (8). 

In another study of 252 children participating in the Berkeley sur- 
vey, Honzik (22) likewise found little prediction pi'ssiblc from early 
tests. Initial tests made at 21 months coi related onl> about .30 with 
retests at 5 and 6 years of a^e. vSoniewhat higher predictive value was 
showm by tests in the upper preschool ages, but the correlations were 
still too low for indoidiial estimates. Essentially the same conclusion 
was reached by Goodenough and M .urcr (18) in follow-ups of over 
200 children who had taken the Minnesota Preschool Test before the 
age of 6. Correlations of these initial scores with Stanford-Binet 
retests at ages 7 Ic^ 12 ranged from .15 to .45. Correlations on smaller 
groups who were lollowed into college were also reported. The corre 
lations between preschool tests and scores on die \.C.E. examination 

'' The fact that collccc students leprcscnt d highl> scleci group intellectudlly, and 
aie therefore moic homocencous in mtelhgence lest scoics thcin a i.indom sampling 
of the general population, would tend to hmet the correlation between initial and 
tenninal scores in a college group. The high correlations actually obtained thus 
indicate even moie vividly the ellect of the continued umformitics in these subjects* 
cdiiLalioi)<il ( ^ pcnemes 
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taken upon college entrance were .12 (with tests taken under age 4), 
.29 (with tests taken between ages 4 and 5), and .39 (with tests be- 
tween ages 5 and 6). 

Several investigators agree that the low predictive value of infant 
and preschool tests cannot be attributed entirely to the unreliability 
of such tests, since reliability coehicients found within short periods 
are often quite high. Moreover, high and low scores tend to occur in 
clusters within any one individual’s successive retests, and thus seem 
to indicate periods of lag or spurt which may extend over several 
years. At least two other explanations have been suggested. First, the 
individual’s development may be more susceptible to environmental 
influences at early ages. Secondly, dilTcrcnt types or combinations of 
functions may be covered by preschool and by subsequent '‘inlclli- 
gcncc” tests. Some evidence for the latter explanation has alreadv been 
cited in earlier sections. It is probable that both factors contribute to 
the low predictive value of early tests.'^ 

Retest correlations also depend upon the iiilci vaJ hctn'cen retests. 
In other words, the interval over which predictions are made alTects 
the accuracy of the prediction. This relationship is clearly demon- 
strated in an analysis conducted by R. L. Thorndike (5S) with pie- 
vioiisly published data on school-age children. By combming llie 
results from those studies with fairly uniform test-retest intervals and 
then fitting a curve to these data, Thoindike obtained an equation 
showing the relationship between time interval and expected correla- 
tion. On this basis he estimated, for e'<.ample, that the test-retest coric- 
lation is .90 for an unmediate retesi, but drops to .70 over a five-year 
interval. The correlations empirically obtained with scIkhiI chilJien b\ 
subsequent investigiitors have in general corroborated the values pre- 
dicted from this curv e (cf.. 10, 59), 

To recapitulate, the predictive value or consistency of intelligence 
test scores increases with the at vvhicli the lest is administered, and 
decreases as the interval between test and retest increases.-^ Both of 
these relationships can be explained on the basis of the "‘(^vcrUify' of 

^ An adcluional explMnalion, in terms of “ovcijap,” will be discussed below, 
ft should be noted ilul the c/lcct of length of test-/elest inlcfv.il up(»n constancy 
of the IQ will itscll very with .lye At the oldci aucs, the same lime inteival is 
accompanied by a much sniaJlei change in test performance, and even lelalivcly long 
intervals are 1:1 ciy to yield fanlv stable results. Thus in the case of tests administered 
to college students, correlations wjih eail;ci tests taken during the fieshmaii year of 
high school arc about as high as lho.se with tests taken duiing the senior ye»i ol 
h gh school (60). 
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abilities at successive age levels (cf. 2). The performance of the older 
individual is based in part upon his retention of abilities which he 
manifested at earlier ages. The older tlie individual, the greater the 
proportion of such overlap between present and earlier performance. 
Let us consider a simplified, schematic illustration. A child’s 10 at 
age 3 may be determined by his successlul completion of 10 items 
which he was also able to pass at age 2, plus 5 additional items; in this 
case the overlap of his performance at ages 2 and 3 is 10/15 or 67%. 
On the other hand, at age 16, this child’s 10 may depend upon 48 
items which he was also able to pass at age 15, plus two new ones; 
now the overlap is 48 '50 or 96%. 

In his presentation of this “overlap” hypothesis, J. L. Anderson (2) 
has summarized the relationship as follows; 

We deal here with a phenomenon in which the prediction of final status 
is based upon a iaracr .aid lari’cr proportion of that which is inv.luded in 
the total; that is, scores at 10 years include more of that which is present 
at 16 years than do scores at 3 years. . . . Since the growing individual 
does not lose what he already haj», the constancy of the IQ is in large 
measure a matler of the part-whole or overlap relation (2, pp. 388-394). 

in support of this hypothesis, Anderson computed a scries of corre- 
lations between initial and terminal “scores” obtained wdth shuffled 
cards and random numbers. I'hese correlations, winch depended 
solely upon the extent of overlap between successive measures, agreed 
closely with test-retest correlations in intelligence lest scores found in 
three published longitudinal studies. In fact, the test scores tended to 
give somewhat lower correlaiions, a diflcrencc attributed by Anderson 
to such fact<irs as errors of measurement and change in test content 
with age. 

Further corroboration of this explanation of the constancy of the 
IQ in terms of overlap is furnished by an analysis by Rolf (45). 
Using previously published data, RofT correlated the intelligence test 
performance of children at any one specific age v^'ilh their gain in 
performance after one or more years. These correlations were all 
close to zero. From such a finding, r' . author concludes that “the 
so-called ‘constancy of the 10’ is due primarily to the retention by 
each child of the skills and knowledge which determined his scores in 
earlier years, and is not due at all to correlation between earlier scores 
and later gains or increments” (45, p. 385). These findings provide 
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an answer to the second question regarding the constancy of the 
IQ, viz., does the empirically observed constancy of intellectual status 
signify regularity of mental development! The answer now appears to 
be clearly “No.” The growing individual exhibits an increasing con- 
sistency of ability level, not because the “rate of growth” is constant, 
but because his present accomplishments constitute an ever increasing 
portion of his future accomplishment as he grows older. This is tanta- 
mount to saying that at age 1 5 we can make a more accurate predic- 
tion of an individual’s subsequent behavior than at age 2, because we 
know more about him at 15. The proportional change in his behavior 
from age 15 to 1 6 is less than from age 2 to 3, and certainly much less 
than from 2 to 16. 

TRAINING AND GROWTH 

We may now attempt to synthesize the findings of the various investi- 
gations and to evaluate them in the light of the studies on training 
discussed in earlier chapters. If we think of mental development in 
terms of learning, the diverse findings both on the upper limit of 
mental growth and on the decline of ability can be fitted into an intelli- 
gible pattern. It might be objected that the learning curve sliows no 
decline, whereas age curves do. This apparent inconsistency results, 
however, from an incomplete statement of the situation. The problem 
will be considerably clarified if wc speak of age changes in specific 
tasks, as we do in the case of learning, rather than discussing mental 
development in general. It is quite true that the cumulative effects of 
learning in everyday life will increase proficiency indefinitely in cer- 
tain tasks, but such learning will just as surely interfere with the 
performance of other tasks. If the general effect of any specific act of 
learning upon all the individuafs behavior is considered, it becomes 
apparent that learning may cause a decline as well as a rise in achieve- 
ment. 

The decline in performance on most psychological tests with age is 
no longer surprising when w^e realize the resemblance of all such tests 
to school work. We should therefore expect that the longer the indi- 
vidual has been out of school, the more chance he has had to forget 
what he learned as a child, through interference from other activities. 

Although in his everyday life the adult may be employing much 
that he learned in school, he is at the same time losing many school 



Age Differences 297 

habits, such as working with a specific time limit, following directions 
literally although he may see little sense in them, and working with 
materials which may be meaningless and of no apparent use to him. 
When a school child is confronted with a psychological test, the nov- 
elty, strangeness, and apparent purposelessness of many of the things 
he is asked to do will not disturb him unduly, since at that age he is 
still doing many things for which he can sec no immediate value. Such 
tasks are accepted by the child as part of his everyday work. Not so 
with the adult. The older he grows, the more he concentrates only on 
those activities which arc cither of practical significance or directly 
pleasurable to him. The reaction of many adults to intelligence tests, 
as contrasted with that of school chiklren, illustrates this difference. 
To most adults, such a test is either foolish or entertaining. The adult 
is far more sensitive to the apparent irnpracticality of the situation 
than the child, who i - accustomed to taking tests which to him may 
seem equally useless. 

That adult ability docs not decline in all tasks is demonstrated by 
the obvious improvement in functions related to the individuaPs daily 
work. The achievements of many people progress along a continuously 
rising line throughout life. 

Nor can a distinction be legitimately made between the extent of a 
person's abilities, which increases conslantly with age, and the level 
or difficulty of task which he is capable of mastering. The latter is 
definitely dependent upon the former. As was brought out in the dis- 
cussion of practice and variability, the more an individual has learned, 
the better able he is to lear. A problem which is commonly regarded 
as diificLilt and which can be solved by only a few individuals is often 
one which involves the synthesis of more numerous and varied types 
of learned behavior. We can say, fo' example, that the derivation of 
a formula which requires a knowledge of arithmetic, algebra, trigo- 
nometry, and calculus is more dilficult than one which can be derived 
simply by the application of principles of arithmetic and algebra. If 
wc define the difficulty of a task objectively in terms of the number of 
people who can perform it correctly it will unquestionably prove to 
be related to tlie number of different ..pccific abilities involved. Even 
if a more subjective, popular definition of difficulty were suggested, it 
would doubtlessly be found to hinge upon the same principle. 

Many of the previously reported findings are clarified if we con- 
sider mental development in terms of learning. Thus the limit of intel- 
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lectual improvement, as measured by common intelligence tests, will 
be reached later by those groups which continue their formal schooling 
to a later age. This has been repeatedly demonstrated in studies on the 
“point of cessalioir' of intellectual growth. The data on the constancy 
of the 10, apart from the purely statistical influence of “overlap,” arc 
also in general conformity with such a “learning hypothesis.” It is 
during the elementary school years that predictions of subsequent per- 
formance can be most accurately made for individual subjects, d'hesc 
arc just the years when American school children, upon whom these 
studies were conducted, undergo the standardized intellectual experi- 
ences provided by uniform curricula. During the preschool years and 
again in adulthood, individuals’ experiences are less standardized, and 
their intellectual performance is less predictable. 

Corroboration of the proposed interpretation of age changes in 
mental traits can also be found in tlie experiments on adult learning. 
It will be recalled that the rate of decline w^as more rapid for the more 
“meaningless” than for the more “meaningful” and useful tasks. A 
similar dilTcrcnce was found between those tasks which were hin- 
dered and those which w^ere aided by the common training furnished 
in our culture. 

Is there any physiologically determined decline in mental activity 
with age, apait from the changes related to learning? The effect of the 
deterioration of requi* ite structures doubtlessly plays a part in the 
marked and sharp decline in all psychological functions which fre- 
quently characterizes .senescence. vSuch obvious handicaps as failing 
vision and hearing, and muscular and neural deterioration can hardly 
fail to affect all the individual's activities. These changes, however, do 
not set in to an appreciable extent until very late in life, and conse- 
quently cannot plausibly be offered as an explanation of the decline 
in mental test performance during earlier maturity, ffdiere arc persons, 
moreover, in whom serious structural handicaps during old age have 
been compensated to a remarkable degree by interest, effort, and the 
advantages of past experience. I'he wide individual differences found 
within any one age level also suggest the importance of specific en- 
vironmental circumstances. 

The physical handicaps of senescence may be regarded in the same 
light as the physical inadequacies of the immature child. Both set the 
upper limits of behavior development at a given chronological period, 
but they do not determine the degree to which such limits will be 
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approximated. It seems, also, that these physically set limits are 
always much higher than is commonly suspected, since training and 
stimulating conditions can at all ages accomplish surprising results. 
Finally, it may be added that age changes in behavior may also vary 
in different cultures (or cultural sub-group^) in which the attitude 
toward old age differs. 
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Family Resemhiance 


Tiul interpretation of family resemblances is complicated by 
the fact that close relatives generally live together. The environment 
of individuals within a single home is certainly more similar than in 
any other situation outside of an experimental set-up. As a result, the 
two classes of factors, hereditary and environmental, operate simul- 
taneously to produce greater likeness within the ordinary family than 
Is found among individuals chosen at random. The closer the heredi- 
tary relationship, moreover, the greater the environmental proximity. 
Thus parents and children, and brothers and sisters, usually live in the 
same home; while more distant relatives, such as uncles and nephews, 
or cousins, come into less frequent contact. Not only arc related indi- 
viduals exposed to common environmental stimulation because of 
similarity of living conditions, but they alsc constitute in part each 
other’s environment and niav become more alike in some respects 
through such mutual interaction. It would seem that family group- 
ings ofler an excellent example of the operation of environmental 
influences in the development of heha^orial similarities. 

Curiously enough, however, family rcscniblances are oiten attrib- 
uted unquestioningly to the operation of heredity. The child is described 
as having his father’s business acumen, his aunt’s musical talent, 
“taking after” his grandfather in obstinacy, and perhaps inheriting 
a keen sense of humor from an Iiish grandmother on his fathers 
side! The successful son of an eminei. "amily attributes his accom- 
plishments to the fact that he is well-born. A lecturer’s vigor and 
zeal are explained by his coming from pioneer stock. A boy’s in- 
genuity with mechanical toys is regarded as only natural when one 
finds that he is descended from a “long line” of boatbuilders and 
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inventors.^ Nor is this type of interpretation limited to popular slip- 
shod thinking and everyday conversation. Many otherwise accurate 
and well-conducted scientific investigations on family resemblances 
commit the same logical fallacy in their interpretations. 

The two major methods employed in the study of family similari- 
ties and differences are family history, or pedigree studies, and cor- 
relation, The former method has been employed chiefly by geneti- 
cists. Genealogies arc traced and detailed pedigree charts drawn up 
for families outstanding either for their deficiencies or for their talents. 
The correlation studies usually deal with the scores of relatives on 
standardized tests. Parents and children, siblings, and twins have been 
compared by this method. The correlation coefllcicnt - furnishes a 
convenient numerical index of the degree of correspondence between 
the scores of any such groups. 

It is of course impossible to determine directly by either of these 
methods what is the relative contribution of hereditary or environ- 
mental factors in producing the obtained similarities. Both methods 
arc at best descriptive and serve only to discover more or less objec- 
tively the degree of familial resemblance present under existing living 
conditions. Only an experimental approach could yield a conflusive 
solution to this problem. If a child of known parentage were isolated 
from its family immediately after birth and brought up under rigidly 
controlled conditions, many of tho' questions on heredity and environ- 
ment might be answered. In such an experiment, it would also be 
necessary to exert some control over prenatal environment, as by 
proper care and diet of the mother. For obvious reasons, such ex- 
periments have not been feasible with human subjects. An approxi- 
mation to this set-up is, however, afforded by the study of foster 
children. The earlier the child is adopted, the more nearly does this 
situation resemble the experimental situation described above. A 
favorable opportunity for the analysis of hereditary and environmental 
factors is also furnished by identical twins who have been reared 
apart from an early age, although the number of such cases is neces- 
sarily small. 

^ A collection of rather amusing excerpts from biographies of eminent persons, 
illustrating the common tendency to look for ancestral origins of the individual’s 
talents and defects, is to be found in Toz/er (37). 

^ Cf. Chapter 2 for a general explanation of correlation coefficients. In tf^e present 
application, the correlation coefficient is used to measure the degree of relationship 
between the scores of i elated individuals on the same test, rather than between the 
scores made by the same individuals on different tests. 
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Because of their more direct bearing upon the heredity-environ- 
ment problem, all studies on twins and on faster children have been 
reserved for a detailed treatment in the next chapter. The present 
chapter will deal exclusively with the more common and general sort 
of family relationships, including parents and children, siblings, and 
more remote relatives or ancestors. 

THE STUDY or FAMILY PEDIGRFES 

The tracing of human family pedigrees with reference to some spe- 
cific and easily identifiable characteristic may reveal valuable data 
on hereditary factors, flie method has proved especially productive 
in the study of simple physical abnormalities, such as albinism, the 
presence of extra fingers, webbed fingers, clubfoot, and a number of 
other rarer and more serious malformations or pathological cond- 
ditions. Certain simple behavior characteristics may also lend them- 
selves to analysis by these methods. The application of pedigree 
analyses to more complex behavior data, however, usually meets with 
well-nigh insurmountable dihiculties. Consequently such use as has 
been made of these methods in the analysis of complex behavior 
data is on the whole open to serious question. Unwarranted inferences 
and ovcrgeneralizalions abound in these studies. 

The identification of hereditary factors from human family his- 
tories involves two major steps: inspection of pedigrees and *'^ene 
frequency analysis'' (cf. 21, 33). First, a number of family pedigrees 
in the charcteristic under observation are assembled. From an exami- 
nation of each of these pedigrees, hypotheses are set up regarding the 
probable hereditary basis of the particular characteristic. As these 
hypotheses are checked against other pedigrees, some can readily be 
discarded, w^hile one may be consistent with all the observed pedi- 
grees and is tentatively accepted. The testing of this tentative hypoth- 
esis in representative samples of the general population constitutes 
the second step, or gene frequency analysis. 

In animal studies, this is the stage at which selective breeding and 
cross-breeding would be carried out as a direct test of the chosen 
hypothesis. Since this is not feasible in human studies, the procedure 
is to compare the frequency of different phenotypes in the general 
population with the frequency expected on the basis of the chosen 
hypothesis. “Phenotypes” refer to the observably different ways in 
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which the characteristic in question is manifested in dilTercnt indi- 
viduals. For example, in the case of a characteristic determined by 
a single pair of dominant-recessive factors, an individual may have 
received two dominant factors from his tw^o parents, or one domi- 
nant and one recessive, or two recessives. Those receiving the 
dominant-recessive combination, however, manifest the dominant 
characteristic. Consequently, only two phenotypes arc found in this 
characteristic, in contrast to the three dilTerent “genotypes” to which 
they correspond. If the frequencies of dominant and recessive genes 
for this characteristic were identical in the general population, then 
the two phenotypes would occur in the well-known Mendelian ratio 
of 3:1. Ordinarily, however, the two genes will not be equally com- 
mon, and the simple 3: 1 ratio will not hold. Nevertheless, under these 
circumstances certain constant relationships between the frequencies 
of different phenotypes will be found.^ It is these relationships that 
arc employed in the gene frequency analyses. Such relationships can 
be derived for various types of hereditary mechanisms, such as domi- 
nant-recessive, blending, and sex-linked characteristics, as well as for 
characteristics depending upon more than one pair of genes. 




ojo 
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Fig. 63. Selected Pedif^rees of I'astc Deficiency in Man. (From Snyder, 
33, p. 416.) 


The use of family pedigree techniques is well illustrated by the 
study of taste deficiency in man. Quite accidentally it w\as discovered 
that some persons report no taste from the crystals of a certain chem- 
ical, phenyl-thio-carbamidc (P.T.C.). To most people, these crystals 
arc very bitter. It was soon suspected that this dilfcrcncc might have 
a genetic basis and investigation of its possible hereditary transmis- 
sion was begun. In Figure 63 are reproduced five family pedigrees 
for this characteristic, selected from several thousand which were 
examined (cf. 4, 33). The first pedigree leaves the way open for 
many different types of hereditary determination, but different hypoth- 

3 All these relationships are derived from the statistics of probability as applied 
to the chance pairing of genes. 
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cses can be eliminated successively as each additional family is con- 
sidered. For example, family No. 3 definitely shows that this taste 
deficiency cannot be attributed to a dominant factor, since the de- 
ficiency appeared in a child both of whose parents were free of it. 
The suggestion that a single recessive may be involved is borne out 
by family No. 5, where both parents show the deficiency. In this 
family, as expected, all the ofispring arc deficient. Olher hypotheses, 
regarding the possibility of sex-linked and sex-infiuenced factors, can 
be ruled out by an inspection of the other two pedigrees. 

Following the tentative acceptance ol the hypothesis of a single 
pair of dominant-recessive genes, a gene frequency analysis was con- 
ducted on a random sample of 800 families. These families included 
some in which both parents were normal tasters, others in which both 
were delicicnt, and still others with one normal and one deficient 
parent. The proportion of tasters and non-lastcrs among the off- 
spring in each of the three types of families, as well as the propor- 
tion of tasters and non-tasters in the general population, constitutes 
the basic data for the gene frequency analysis. If the chosen hypoth- 
esis holds, certain rclaiioiiships are expected among these various 
proportions. In 'fable 13 are shown the obseivcd and expected per 
cents of non-tasters among the offspring of each t}pe of family.^ If 
taste deficiency depends upon a single recessive factor, all offspring 
of two non-taster parents should be non-tasters. I'hai the obtained 
per cent is 97.76 rather than UK), owing to the presence of five 
tasters in this category, need not be regarded as evidence against the 
hypothesis. The in\ csligators (4) suggest a number of possible reasons 
to acccHint for these cxcet-. ons. the Hibjective nature of the taste 
experience may have led to incorrect diagnoses; parentage may have 
been incorrectly determined hecau'. ' of unsuspected adoption or 
illcgillmacy; mutations or unknown fa tors of a hereditary or environ- 
mental nature may have affected the operation of the recessive gene. 
In tlie other two t\pes of families, if will he noted that the observed 
and expected percentages agree closely and thus conhrm the hypoth- 
esis of a single recess ive gene.” 

For an cxplanafuMi ot (he t^oinpiiiation ( '•'•c expected percenla les, ct. 4; .t2: 
and 33, Ch. 2^. 

‘•'the slal.slically trained readei wsll note that the didetcnces in both groups aie 
smallei than then respetljvc siandaid errojs and uic Ihercloie veil within the range 
oi \ariaLion to be expected fiijji saiiii)liin’ criois. Neither o( the difTciences is thus 
statistically si'Mi.licani. (The conce[)t ol sialistically signili'^arit diMcicnce will be 
explained in Ciiaptei 18.) 
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TABLE 13 Gene Frequency Analysis of Taste Deficiency for P.T.C. 


(Jr'rom Cottermaii and Snytler, 4, p. 514) 


Type of 

Number 

of 

Total 

Number of 

Per Cent of Non-Tasting I 

Offspring | 

Family 

Families 

Otfsp/ing 

Observed 

Expected 

Difference 

Both parents 
non-tasters 

86 

223 

97.76 

100.00 

2.24 

One parent taster, 
the other non- 
taster 

289 

761 

36.53 

35.32 

1.21 It 1.76* 

Both parents 






tasters 

425 

1159 

12.28 

12.47 

0.19 It 1.02* 


*'■ Standard error of the dilfeieiKc. 


When either the pedigrees or the observed frequencies are not con- 
sistent with any unit-factor hypothesis, other hypotheses arc set up in 
terms of two or more pairs of factors. For example, wilh two pairs of 
dominant-recessive factors, four phenotypes will be found. Even more 
phenotypes result when there is a lack of dominance in one more 
pairs of factors. Under these conditions, the frequency patterns be- 
come more complicated, but they are still predictable and therefore 
amenable to testing. When the number of hereditary factors involved 
is very large, however, an almost infinite number of quantitative 
gradations is found, rather than distinct phenotypes. In such cases, the 
frequency distribution approaches the normal curve. 

The multiplicity of hereditary factors contributing to most behavior 
functions is one of the obstacles encountered in the application of 
family pedigree methods to human behavior data. If the observed 
frequencies follow the normal curve, little can be deduced beyond 
the operation of a very large number of factors. Moreover, the same 
frequency distribution could result from the combined effect of the 
innumerable environmental influences to which the developing indi- 
vidual is exposed. Such results certainly do not permit the same clear- 
cut interpretation which is possible when simpler genetic ratios are 
involved. 

A second disturbing factor in such analyses is the indisputable 
operation of assortative mating in human marriages. Gene frequency 
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analyses are based upon the assumption of random mating. This 
assumption is probably justified, on the whole, with regard to such 
characteristics as the taste deficiency described above, since most 
individuals arc not even aware of this deficiency in either themselves 
or their associates. Moreover, this deficiency appears not to be cor- 
related with other characteristics which might enter into assortative 
mating, such as general appearance, physique, intellectual level, socio- 
economic level, or national, racial, or geographical background. 
However, most behavior characteristics — and many physical character- 
istics — either play a direct part in assortative mating or enter indi- 
rectly through their association with socio-economic level, geograph- 
ical distribution, and the like. Individuals tend to marry within their 
own groups, economically, nationally, geographically, and intellec- 
tually. Husband-wife correlations in intelligence tests, for example, 
are generally in the neighborhood of .50, and in physical traits they 
cluster around .25 (I6j. In personality characteristics, the correla- 
tions vary widely, as would be expected. In the more purely emotional 
characteristics, such as emotional stability and social dominance, the 
correlations arc relatively low and sometimes negative, averaging 
about .14 (5). On tests of attitudes and values, the correlations range 
from the .20's to the .7()'s and average about .59 (5). To be sure, 
such marital correlations may result in part from the common experi- 
ences and mutual influence of the spouses after marriage. It is doubt- 
ful. however, whether such influences can account for a large part 
of the observed correlations, especially since many ol the subjects 
of these studies had not been married long. Most of the correlation 
can thus be safely attributed to assortauve mating, or the tendency 
lor similar individuals to marry. 

A third ever-present diflicully in thi. genetic analysis of human be- 
havior data is the influence of environmental factor^. Ihc testing of 
genetic hypotheses implies cither a constant influence of environment 
or random environmental variation. In actual fact, however, environ- 
mental dilTerences among individuals are not random, but tend to go 
hand in hand with hereditary dilTerences. Thus the child of physically 
defective or feebleminded parents is more likely to have low 
socio-economic level, poor physical care, and inferior education than 
is the child of intellectually and physically superior parents. 

A further difficulty is presented by the likelihood of inaccurate and 
incorrect diagnosis, especially when information is sought regarding 
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individuals who have been dead for many years. The data collected 
in retrospect on feebleminded ancestors, for example, are often based 
upon reports by untrained persons or upon inadequate records. An- 
other dilliculty, in the reverse direction, is encountered when gath- 
ering information on characteristics which are not manifested until 
late in life. For example, certain psychoses usually develop among 
older persons. Informatiim on these conditions cannot, therefore, he 
obtained while the subjects are still young. Moreover, some indi- 
viduals die before reaching the age when such conditions might have 
developed. 

THE FAMILIES OF EMINENT MEN 

It should be apparent that the mere recurrence of a characteristic 
within a family pedigree proves nothing regarding its hereditary de- 
termination, riic proper genetic study of family pedigrees, as shown 
in the preceding section, involves much more than the simple fact 
of family resemblances. Nevertheless, because most human behavior 
characteristics do not lend themselves to the precise methods of 
analysis outlined above, many widely quoted studies of family pedi- 
gree provide little or no information beyond the greater similarity 
of behavior among related than iimong unrelated persons. This tvpe 
of familiiil investigation was launched b\ the publication, in TSbQ, 
of Sir Friincis (jiillon’s Hereditary Genius. 

Gailon-s approach was distinctly hcreditarian, as illustrated by the 
following summary of tlic aim of his investigation: 'i propose to 
show in this book that a man’s natural abilities are derived by inherit- 
ance, under exactly the same limitations as are the form and physical 
features of the wlude organic world” (10, p. 1 ). Data were collected 
on 907 eminent men in 300 families. In order to facilitate the tracing 
of family histories and the location of descendants and other rela- 
tives, the study was limited to eminent men who were either English 
or w'cll known in England, Ehe information w'as obtained from bio- 
graphical collections or through direct inquiry among relatives and 
acquaintances of the men themselves. Gallon defined as follow's the 
degree of eminence necessary for inclusion in his survey: “When I 
speak of an eminent man, 1 mean one who has achieved a position 
that is attained by only 250 persons in each million of men, or by 
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one person in each 4000’" (10, p. 9). The classes of men in Galton’s 
survey comprised English judges/* statesmen, commanders, literary 
men, scientists, poets, artists (musicians and painters), and Protestant 
divines, the last including men who had achieved fame through some 
phase of religious activity, such as theological scholars, administra- 
tors, religious leaders, martyrs, preachers. 

Within each family, the most eminent man was taken as a point 
of reference, and ail kinships were expressed in relation to him. Fol- 
lowing the name of each of these men, (ialton appended a list of 
famous relatives together with the major held in which each had 
achieved distinction. Whenever more complete information was avail- 
able. these data were presented in the lorm of a family pedigree 
cluiit. As a final summary of his findings. Gallon computed the per- 
centage of eminent men in each degree of kinship to the most eminent 
man of the family, the latter still serving as tiiC point of reference, 
d'hese percentages are given in d'able !4 for each class o\ "eminence” 
separately, as well as for all classes combiiied. It should be noted that 

I’ABIJv 14 Pcr('etUay,e of i,nuiirn( Relatives of Men in luieli C lass 


1 II ji 'I'S ) 












Matin c of Kinship 
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33 

47 

4S 
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20 

32 

28 
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^5 

39 

50 

42 

47 

40 

50 

36 

41 

Son 


49 

31 

51 

60 

45 

89 

40 

4S 

Giamlfather 

15 

28 

1 6 

24 

14 

5 

7 

20 

17 

Uncle 

IS 
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S 

24 

16 
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14 

40 

18 

Ncplicw 

19 

IS 
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. 

23 

50 
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4 

22 
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iO 
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14 

Great-giandCalhcr 

-s 
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4 

.7 
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5 
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20 

18 

16 

0 

1 

8 

13 
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17 

3 

8 

6 

16 

10 

0 

0 

10 

Gieai-giaiidson 

6 

0 

0 

3 

7 

0 

0 

0 

3 

All more remote 

14 

37 

44 


23 

5 

18 

16 

31 


*No female relatives are incliulerl in Ursc summary fi^ies, ulthmigh the names 
ind at Ine VC’ me 1 1 Is ci) siicli rel 4 ’.li'e> aie given m the specilic family liistorKS. 

‘H hc only eategmy limited exclusively to England. 
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the eminent relatives within any class have not necessarily achieved 
distinction in that particular area; thus the famous kinsmen of a 
statesman may include scientists, artists, divines, etc. The classifica- 
tion is based solely on the field of activity of the “most eminent” 
man in the family, around whom the data are organized. 

These figures suggest quite strongly that eminence tends to run 
in families. Not only aiC the percentages much greater than is ex- 
pected by chance and fairly consistent from class to class, but they 
also show a definite decrease in the frequency of eminent relatives 
as the degree of relationship becomes more remote. It is quite a dif- 
ferent matter, however, to conclude that genius is inherited. Gallon, 
to be sure, recognized the dinicultics in the way of such a conclusion 
and attempted a systematic analysis of them. To the question of 
whether reputation is a fair test of ability, he answers in the affirma- 
tive. He argues that reputation or eminence, as the criterion is em- 
ployed in his survey, is “the opinion of contemporaries, revised by 
posterity — the favorable result of a critical analysis of each mans 
character, by many biographers*' (10, p. 33), and hence is not an 
accidental lise to short-lived notoriety. Natural ability he define^ 
quite circularly as “those qualities of intellect and disposition, which 
urge and qualify a man to perform acts that lead to rc]Tutation” 
( 10, p. 33). 

Although admitting the influence of training, surroundings, and 
opportunities, Cialton minimizes the part which they play in the 
attainment of eminence. He constantly hedds up to the reader the 
heroic pictuie of gem us triumphing over obstacles. By definition, 
genius means to him "a nature which, when left to itself, will, urged 
by an inherent stimulus, climb the path that leads to eminence, and 
has sticngih to reach the summit — one which, if hindered or thwarted, 
will fret and strive until the hindrance is overcanue, and it is again 
free to follow its labour-loving instinct” (10, pp. 33-34). He C(m- 
cludcs that “It is alnuwt a contradiction in terms, to doubt that such 
men will geneially become eminent,” and adds that “there is plenty 
of evidence in this volume to show' that few have won high reputa- 
tions without possessing these peculiar gifts" (10, p. 34). This is true 
enough, but it remains to be proved that such “gifts” as the impulse 
to climb, the strength to teach the summit, and the love of labor arc 
themselves independent of environment. Unfortunately, the optimis- 
tic picture painted by Gallon is not borne out by observations of 
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everyday life; and in the absence of empirical proof, it is impossible 
to accept Galton’s interpretations of his findings.' 

DKGENERArr FAMILIES 

The family history method has also been widely employed in the 
effort to analyze the causes of intellectual defect, crime, pauperism, 
and similar conditions. By this method, a number of families have 
been discovered which present an overwhelming array of socially 
inadequate persons over several generations. The same general tech- 
niques arc used in tiacing the history of these families as in the study 
of eminent groups. Living relatives or descendants are visited and 
observed, residents of the vicinity are interviewed, and certifieates of 
marriage and birth and similar public records are examined when- 
ever available. These families are usually found in rural districts in 
many parts of the country, often inhabiting the same crude huts built 
by their ancestors many generations ago. They interbreed extensively, 
are quite prolific, and eventually come to constitute their own com- 
munity, avoided and ridiculed by their neigh[)ors. 

The earliest published pedigree of such a “degenerate’'’ family is 
that of the “Jukes,” described by Dugdale (7) in 1877 and subse- 
quently traced up to 1915 by Estabrook (8). This family first at- 
tracted official notice in the course of a prison survey in New \ ork 
State in 1874. Six persons, all of wJiom were blood relations, were 
found in prison in one county. This finding initiated a thorough search 
for other relatives living in the county and finally led to an extensive 
family history, which cover'd ^cvcn generations and included 540 
individuals related by blood and 169 related by marriage or cohabi- 
tation. The total cost of this family to the state through pauperism, 
crime, vice, disease, and similar conuuions was estimated as one and 
onc-half million dollars within 75 years. 

The original Jukes were five sisters or half-sisters whose progeny, 
legitimate and illegitimate, have been traced for five generations. Two 
of these sisters married two sons of “Max,” a descendant of the early 
Dutch settlers, who lived as a backwoodsman and is described as a 

Other studies on eminent families will be found in Chapter 17 on Genius. 
Gallon's study is here icpoited only as an example of this application of the family 
history method. 

All the names in these histones are, of course, fictitious, but they have become 
well known iri the psychological literature. 
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‘ hunter and fisher, a hard drinker, jolly and companionable, averse 
to steady toil’’ (7, p. 14). lliis man was born in New York State 
between 1720 and 1740. I'he genealogy of the Jukes is usually begun 
with Max, although it is the progeny of the five sisters who have been 
traced and arc shown in the pedigree charts. 

It is interesting to note that, despite the fact that family histories 
arc usually cited as examples of hereditary characteristics, Dugdale 
seemed to be fully cognizant of the influence of environment, as is 
shown by the following conclusion: “From the above considerations 
the logical induction seems to be, that environment is the controlling 
factor in determining careers . . . the permanence of ancestral types 
is only another demonstration of the fixity of the environment within 
limits which necessitate the development of typal characteristics” 
(7, p. 66). And again, in a final summing up of his findings he calls 
attention to the fact that ‘in the ‘Jukes’ it was shown that heredity 
depends upon the permanence of the environment, and that a change 
in the environment may produce an cniirc change in the career, 
which, in the course of greater or less length of time, according to 
varying circumstances, will produce an actual change in the character 
of the individuaf’ (7, p. 113).'* 

i'he '‘Kallikak” family of New Jersey, described by Goddaral (12), 
is particularly interesting since it consists of two branches, one 
normal, the other degenerate. The history of this family has been 
traced to the days of the Ameru'an Revolution. "Martin Kallikak,” 
a 21 -year-old youth of good family, who had joined one of the many 
military companies organized at the time, had sexual relations with 
a feebleminded girl whom he met at a tavern. The illegitimate child 
of this union, referred to as “Martin Kallikak, Jr.,” was the progenitor 
of the degenerate side of the family. Martin, Sr., at the age of 23 
married an intellectually superior woman of his ov/n social level and 
thereby tounded a normal family, many of whose members have 
achieved distinction. A pedigree chart showing the normal and degen- 
erate lines of the Kallikak family is reproduced in Figure 64. The 
forbears of Martin, Sr., arc shown for three generations, and the two 

Winship (40) has contrasted the Jukes with ihe Edwards, a distinguished family 
descending tioni Jonathan fc'dwards, a highly educated and famous theologian of 
eighteenth-ccniury America. 'Ihc comparison rs striking, but not very informative, 
since the two families were entirely independent, of different ancestial stock, and 
living in very ddferent environments. 
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branches which he founded arc traced through the line of the eldest 
son to a member of the present generation. 


Caspar 



Normal Woman. 

Fir. 64. A Pedigree C h.irl l.I Ihe Kaliikak r ci.iiiy (From Cioddard, 12, 

r- 

In evaluating t!ic andings on tlie Kaliikak family, Goutuuu con- 
s-lanlly emphasizes the role ol heredity. Having laid great stress upon 
the I'ael that the two groups were branehes of the same family, 
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furnishing, “as it were, a natural experiment with a normal branch 
with which to compare our defective side,’’ he states that “from this 
comparison, the conclusion is inevitable that all this degeneracy has 
come as the result of the defective mentality and bad blood having 
been brought into the normal family of good blood’’ (12, pp. 68-69). 
It seems rather curious that the common descent of the two branches 
from Martin Kallikak should be regarded as strengthening a heredi- 
tary interpretation of the difierenccs between them. The environments 
of the two groups were not in any way equated by this common an- 
ce::try. In fact, it is evident that the members of the two branches 
were reared under widely differing conditions. A more crucial test 
would have been available if the legitimate offspring of Martin and 
his well-born wife had been exchanged at birth with those of the 
feebleminded woman. It would then have been very illuminating to 
ascertain the relative percentage of feeblemindedness and other de- 
fects in the “normal” and “degenerate” stock. The practical obstacles 
in the way of such a procedure in no way excuse faulty conclusions 
drawn from an inadequately controlled situation. 

Leading geneticists have been critical of the Kallikak study since 
it first appeared. Goddard’s assumption that feeblemindedness is 
transmitted by a single recessive gene seems indeed a gros? over- 
simplification in the light of present knowledge of heredity. More 
specifically, Goddard maintained that Martin Kallikak, Sr., must have 
had a recessive gene for feeblemindedness, which would account for 
the recurrence of feebleminded descendants from his union with the 
feebleminded girl. If this had been the case, however, the complete 
absence of feebleminded offspring in the “good branch” of the family 
would be difficult to explain. Criticism has also been directed against 
much of the data which forms the basis of this family history 
(e.g., 29, 30). For example, the only evidence for the paternity of 
Martin Kallikak, Jr., is based on the original report of the feeble- 
minded tavern girl. The IQ’s of long-dead Kallikaks were often 
estimated on the basis of the reminiscences of elderly neighbors. 
Moreover, in tracing the family pedigrees the rather questionable 
assumption was made that such conditions as pauperism, crime, im- 
morality, and epilepsy arc all manifestations of the same recessive 
gene which produces feeblemindedness. 

Many equally “degenerate” families have been subsequently in- 
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vestigated by psychologists, sociologists, or eugenicists. The research 
staff of the Eugenics Record Office conducted many such surveys 
as one phase of its regular work. Among the groups thus studied 
were the Hill Folk, the Nam family, and the W family of Indiana, 
all presenting the same picture of degeneracy, mental defect, dis- 
ease, and social incompetence through successive generations. Sur- 
veys of eminent families have likewise been sponsored by the Eugenics 
Record Office. Specific lines of achievement, such as scholarly pur- 
suits or boat designing, have been traced from generation to genera- 
tion in the attempt to show that such talents are transmitted through 
heredity. Although offering much interesting material, such studies 
cannot yield any data on the heredity-environment question; the 
opportunities for environmental transmission of such family qualities 
are too obvious to overlook or dismiss.’’ 

Among the most recent applications of tlie family pedigree method, 
greater care has been exercised to insure the accuracy of the original 
data (cf., e.g., 15, 20, 38). Diagnoses of fceblemindness among 
relatives in earlier generations, for example, are examined more crit- 
ically and accepted only v\iien verified by institution records oi com- 
mitment papers, Neveilheksss, the large majority of these studies are 
still subject to the fallacy of regarding mere recurrence of a charac- 
teristic within the family as proof of heredity. Only rarely is any 
attempt made to suggest a specific hypothesis of hereditary transmis- 
sion (cf., e.g., 1), Nothing even remotely resembling the type of 
evidence described in the opening section of the present chapter is 
provided. Most investigator'^ are apparently interested only in showing 
that the condition ‘ runs ^ : families.'’ To regard these studies of 
feebleminded or of eminent families as applications ot the genetic 
methods of pedigree analysis can lead to confusion. 

^''Fiigenics Record Olllce, Cold Spring Tlaiboi, limy Lsland, N. Y. Cf., e.g., 
JcfcrcnCL•^ (6) and O)). 

\ typical c\.inip>c of \hc m’snse of data on family resemblances is to be 
found in an aiticie by R fe and Snyder f2b). Ihnfv-ihree conlempoiary case Ins 
lories of “idiots savanis" in Ameiican insiitMUons ai? descMibed and, on the basis of 
certam familial resemblances, me ollercd as a “rclutation” ol environmental deter- 
mination ol mental development ilic veiy fact that these fccblcMiiirdcd subjects 
e\h bited sivjiie sjrccial talent sccnis also to ' ' "iviided, by a pecubei lotne, as evi- 
dericc lor heiediU Amoivi the cases mled is ti.at ol a low-giaJe ichot who could 
spin objects rapidly with eiihei hand, balanL.mg them on his mde\ fmjtei. Both of his 
parents were vaudeville actois’ 1 hiS case was presented in all seriousness as “evi- 
dence” for the mherilancc of special talents. 
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PARLNT-CHILD RESEMBLANCE 

The use of the correlation technique, although more precise, does 
not eliminate the essential difliculty inherent in all family compari- 
sons, namely, the confusion of hereditary and environmental con- 
tributions. Pearson (23) was among the first to apply correlation 
analysis to parent-child resemblances. Continuing a line of research 
initiated by Galton (11), he collected measures on parents and off- 
spring in physical traits such as stature, arm span, and forearm length. 
The parent-child correlations in these traits averaged about .52. The 
similarity of this correlation to those obtained for bodily characteristics 
of many animal forms led Pearson and others to suggest that this 
figure indicates the contribution of hereditary factors to the develop- 
ment of physical traits. Family resemblance in such traits is probably 
attributable in large part to heredity, although the influence of similar 
environment, especially in the prenatal stage, cannot be overlooked. 

More recently, scores obtained by parents and children on stand- 
ardized psychological tests have been correlated. In the most exten- 
sive of these studies, Conrad and Jones (3) administeied intelligence 
tests to 269 family groups, including 977 persons between the ages 
of 3 and 60. All subjects were native-born, spoke only Knglish at 
home, and lived in rural districts of New Hngland. Socio-economic 
diflercnces within this sampling were small. The younger subjects 
were tested with the Stanford-Binet, the older with the Army Alpha 
intelligence [examination. Tor the entire sampling, the total parent- 
child correlation obtained with these tests was .49. No consistent oi 
significant dillercnce w'as found between molhcr-child and father- 
child correlations, nor did the correlation of sons or daughters with 
their like-se\ parent differ from the correlation with their unlike-sex 
parent. It might be argued that if environment is important in pro- 
ducing these familial resemblances, then children should resemble 
their mother more closely than their father. It is true tliat the mother 
generally has closer contact with the children than does the father, 
but it may also be noted that the father’s intellectual level probably 
determines the socio-econon'iic level of the home more than does that 
of the mother. Conrad and Jones demonstrate statistically that the 
obtained correlation of .49 is consistent with an hereditary interpreta- 
tion of parent-child resemblances in intelligence, after allow^ance is 
made for assortative mating. They recognize, however, that the results 
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are equally consistent with a purely environmental hypothesis, or with 
a combination of hereditary anil environmental inlluenccs. 

In generalizing fnmi the specilic correlation found in this study, 
two further facts should be borne in mind. First, the parent-child 
correlation vanes with the naiiire of the test. For most intelligence 
tests, which are a composite of many tasks of a predominantly verbal 
nature, the correlation of about .50 is probabK typical. The coi rela- 
tion on more homogeneous and simpler tasks will, in gencrai be 
lower. Non-verbal functions, inorco\er, lend to give lower correla- 
tions than the more highly verbal (30). Perlormanee in verbal func- 
tions is probably more dependent upon dillerences in previous 
experience and home background, a fad whidi may account for the 
closer family resemblances on \erbai tests. 

A second factor which alTects familial correlations is the degree 
of Jwniotienehy of home hat ki^ronnd within the group. In the Conrad- 
Jones study, it will be recalled, the sampling was particularly hoino- 
g:eneous. The authors call attention to this fact, pointing out that the 
apparent influence of a common home environmint w'ithin each 
family is minimized when the dilTcrences from home to home are 
slight. Thus the correlations between paients and children might be 
much higher if a wider range ot homes were sampled. 

It should also be noted that the parent-child correlation of approx- 
imately .50 in intelligence test scores is not found until the child is 
about 5 years of age (2(S). flie corrclalion is considerably lower at 
earlier ages and approaches zero in infancy,. It will be recalled that a 
similar lack of correlation was found between the individual’s score 
in infancy and his own latei performance. 7 he two findings probably 
have a similar explanation. A principal factor in such an explanation 
is undoubtedly the difference in I havior functions tcstctl among 
preschool children and among older children or adults. 

Parent-child correlations in personality test scores also tend to be 
positive and significant, altliough running lower than intelligence test 
correlations (5). I'he correlations vary widely with the particular 
aspect of personality under consideration. On the v/hole, the degree 
of parent-child resemblance indicate ' ’w the available data appears 
to be lower for cmofional characteristics, such as introversion, domi- 
nance, or ncuroticism, and Ijigher for attitudes. In fact, the average 
parent-child correlation on most attitude scales is approximately as 
high as on intelligence tests. In connection with the relatively low 
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correlations on tests of emotional characteristics, it is interesting to 
consider the possible effects of parental personality upon the develop- 
ment of the child’s personality. It is likely, for example, that exces- 
sive dominance in a parent may foster the opposite type of reaction 
in the child. The effects of parent-child interaction probably differ 
widely with the degree of the personality characteristic manifested, 
as well as with many other attendant circumstances. 

THE COMPARISON OF SIBLINGS 

The study of siblings, especially when both arc in school, does not 
present the practical difficulties met in testing parents. Consequently, 
investigations on the resemblance of siblings are more plentiful, over 
a dozen studies on adequately large samplings being on record. In 
the previously cited study by Conrad and Jones (3) on familial re- 
semblance, a total of 644 individual siblings in 225 families were 
tested. The correlation was identical with that found for parents and 
children in the same study, viz., .49. That the sibling correlation on 
most intelligence tests is in the neighborhood of .50 has been re- 
peatedly confirmed. The corrclati(m between 384 pairs of siblings 
tested during the standardization of the revised Stanford-Binet wScalc 
( 19) was found to be .53. 'The same correlation (.534) was obtained 
with about 650 pairs of siblings tested in Scotland (27). The latter 
group was especially free from limitations of sampling, since it repre- 
sented all siblings located during a project in which every child born 
in a given community within certain dates W'as tested. 

Imder various conditions, the sibling correlation in mental test 
scores mav drop as low as .30 or rise to nearly .70 (14). Hetero- 
geneity of the samplings tested is undoubtedly a factor in some of 
these differences. Correlations as a whole tend to be higher in more 
heterogeneous groups in w'hich the scores range more w'idely. 
Among college students, who represent a much more homogeneous 
group than the general population, the sibling correlation in intelli- 
gence test scores is closer to .40 than to .50 (34, 35). When siblings 
attending a single school are tested, however, the influence of the 
common school environmefit, together with selective factors, may 
exert the opposite effect upon the sibling correlation. Thus if one 
member of a sibling pair is attending college and the othei is not, 
such a pair would automatically be excluded from the study. But 
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these are the very pairs likely to show the largest differences in test 
performance. Their omission would therefore raise the apparent cor- 
relation between siblings. In a high school sampling, for example, in 
which selection and a common school environment probably had 
more effect than the slight increase in homogeneity, the sibling cor- 
relation on an intelligence test was .60 (36). As in the case of parent- 
child correlations, the nut me oj the test also affects the size of sibling 
correlations, the more verbal t3'pe of tests tending m general to yield 
higher correlations (30). 

Sibling correlations show no consistent trend cither to rise or drop 
with age, when the same intelligence test is used throughout (19). 
It is of course true that the older the subjects, the longer will environ- 
mciual factors have operated upon them. But whethei such factors 
exert a leveling or a differentiating inllucncc upon the development 
of siblings within any one family obviously depends upon v'i'iethcr 
the environments of the siblings have remained similar or diverged 
with age. If. for example, one sibling goes away to boarding school 
at age 10, while the c'thcr remains at home, it would hardly he reason- 
able to expect environment to make them more alike with age just 
because they are members of the same family. 

The amount of age (hsciepancv between siblings also appears to 
have little or no edect upon sibling correlations in intelligence test 
scores (19, 25). For an interpretation of such liiidings, much more 
information is needed regarding the social interaction of siblings with 
each other and whth their parents. A preliminary effort to investigate 
suc’n social factors, cspcciall}’ as they affect the iiiteUigence test per- 
formance of older and younger siblings, is illustrated by an intensive 
follow-up study of 39 pairs of siblings, conducted as a part of the 
Pels Growth Study (17). All the sib.»ng pairs consisted of a first- 
born and a second-born cinld. The children, ranging in age from 30 
months to 12 years, were tested at regular intervals with alternate 
forms of the Stanford-Hinci. With such data, it w-as possible to com- 
pare the performance of first- and sccond-boin siblings in each family 
on tests administered at the same age. I'he two siblings in each pair 
were thus compared on the same te.'>t terns. Significant differences 
in the frequency with which first-born and sccond-born siblings passed 
certain Stanford-Binct items were found. In general, the first-born 
siblings tended to excel on relatively ab^ffract, verbal items, while the 
second-born v/crc superio; on a larger number of iiems, and espe- 
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dally on items involving realistic, concrete tasks. The type of intel- 
lectual stimulation received by the first-born child, who is more likely 
to have adult companionship, is suggested as one possible factor to 
account for these ditferences. 

The comparison of test correlations between like-sey and iinlike- 
sex siblings shows no consistent differences (3 ). One might expect a 
closer resemblance between like-sex siblings because of greater simi- 
larity of experience. The interaction and mutual influence of children 
within the family may be such, however, as to counteract the simi- 
larities in the environments of like-sex siblings. When possible sibling 
rivalries and similar motivational factors arc considered, it is apparent 
that no simple relationship between the development of like-sex and 
unlike-sex siblings can be predicted. 

As is true of parent-child correlations, sibling correlations on 
pctsomility tests arc lower, in general, than on intelligence tests. When 
ratings arc employed, as in a pioneer study by Pearson (22), the 
sibling correlations will be spuriously high because of the rater’s 
tendency to rate two members ('jf the same family alike. On groups 
of 500 or more siblings, Pearson found sibling correlations in the 
.50'’s and .60’s in such traits as “vivacity” and “self-asserUvencss.” 
In contrast to these results with ratings, test scores have yielded 
correlations of about .15 in emotional adjustment, introversion, and 
similar characteristics (5, 24). On attitude scales, the sibling corre- 
lations are higher, clustering betw/een .30 and .40 (5). In their 
extensive study of character traits among school children. May and 
Hartshorne (IS) compared the performance of 734 pairs of siblings. 
'I'he sibling correlations on tests of honesty ranged from .21 to .44; 
in persistence and inhibition, the correlations ranged from ,14 to .46, 
and in service and self-sacrifice, from .05 to .40 (5, 18). 

What are the implications of sibling studies for the problem of 
heredity and environuient? wSome have pointed out that the intelli- 
gence test correlation of approximately .50 found betw^een siblings 
in the general population closely resembles the correlation to be 
expected for a characteristic determined by multiple factor heredity 
(27). Nevertheless, the tact remains that the obtained correlation 
lends itself with equal facility to other interpretations, and no one 
hypothesis can therefore be accepted solely on the basis of such a 
correlation. Attempts liavc also been made to compare the sibling 
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correlations in psychological and in structural characteristics, in an 
effort to disentangle the relative contributions of heredity and envi- 
ronment (22, 36). It has been argued, for example, that since the 
sibling correlation in such traits as height and intelligence is very 
similar, and since hciglit can be little influenced by environment, then 
intelligence must be equally independent of environment. This argu- 
ment begins by assuming that psychological and physical traits are 
influenced to an equal degree by heredity. Any influence of environ- 
ment upon psychological traits would then be superimposed upon 
this common hereditary inllucncc and would be expected to raise the 
correlation for psychological traits. Such an argument obviously begs 
the question. 

In this connection may also be ciynsidercd the implications of 
sibling correlations in animal studies. In an investigation of maze 
learning in white rats (2), for example, a sibling correlation of .31 
was found in the error scoresd- Since all the rats were living under 
fairly uniform conditions, this sibling correlation obviously cannot 
be attributed to environmental dilTerences among the ' rat families,” 
hut rather indicates the influence of hereditary structural factors upon 
maze learning. That such factors do operate in maze learning was, 
of course, indicated in the selective breeding experiments previously 
discussed (cf. C’h. 5). Does this clearly non-environmental sibling 
correlation in the rat experiments suggest that the ^ibling correlations 
in the human studies arc likewise determined principally by hereditary 
factors? Not at all. I hcre is no basis for supposing that the same or 
similar structural factors which operate in a motor learning situation 
in white rats aho operate in dC bi.'havicm sampled by luiman inlelli- 
genee tests. We cannot generalize from one siluatiim to the other, 
any more than we could generalize om studies on sensori-motor 
learning in infants tc’* the learning c>l calculus by college students, 
in our earlier discussion ot maturation and learning (Ch. 5). 

An interesting illustration of the fact tliat similar correlations may 
have very dillerent origins is turnished by an investigation on Louisi- 
ana public school children in grades 5 to 1 1 Ol ). Having located 
203 pairs of siblings in these grades, .’’C investigator paired each 

‘•’Because some of the liilcis classified as independent n.ay have actually been 
half-siblings, the authors suggest (hat iheir group may have been alypically homoge- 
nous and the obtained torrclalion ,ousetiuenUy loo low. It js therefore likely that such 
a sibling correlation should be somewliat highci than .31. 
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child with his own sibling and also paired him with an unrelated 
child of the same age, of similar socio-economic background, and 
attending the same school. The intelligence test scores of these unre- 
lated pairs of children correlated .35, only slightly lower than the 
correlation found between siblings in the same study. Had the home 
backgrounds of the unrelated children been paired off more precisely 
and on the basis of a larger number of characteristics, the correlation 
between their intelligence test scores might have been even higher. 

In conclusion, the study of family resemblances in complex intel- 
lectual and emotional characteristics, whether by correlation or by 
other techniques of pedigree analysis, docs not furnish any unam- 
biguous clues to the origin of such resemblances. The results do sug- 
gest the complexity of factors which operate within the usual family 
milieu. Despite the superficial uniformity of environment, some of 
the interactions among individuals in the family group may make 
for similarity of psychological development, while others may produce 
progressively divergent trends of behavior. T hese considerations have 
prepared the way for an understanding of some of the findings on 
twins and foster children, to be discussed in the following chapter. 
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Twins and 
Poster Children 


Certain special family relationships have been singled out by 
investigators as offering a more direct opportunity to disentangle the 
contributions of heredit.ry and environmental factors. Chief ::mong 
the groups studied for this purpose arc twins and foster children. 
Attention has also centered upon children reared in institntiomil en- 
viroiimerns, such as orphanages. 'I’wins and foster children fall at 
opposhe poles in respect to hereditary similarity. In the case of iden- 
tical twins, heredity is completely alike for the two individuals, since 
they develop from a single fertilized ovum and thus have identical 
sets of genes. At the other extreme, foster children arc reared in a 
family unit with which they have no hereditary connection whatso- 
ever. It follow\s that any difference between identical twins must result 
from the operation of environmental factors. Conversely, siinilarities 
betw^ecn foster children and 'cii foster parents or foster siblings 
suggest the influence of the common home environment. 

The study of fraternal, or non-idcntic' I, twins also provides a prom- 
ising approach to this general problem Such twins arc no more alike 
than ordinary siblings in respect to heredity. They have, however, 
been exposed to a similar pienatal cnviionment, since they developed 
and were born at the same time. Being of identical age, they are also 
exposed to more nearly similar stimuhition throughout childhood than 
are ordinary siblings. They would thus r:cm to offer a sort of "hered- 
itary control” in the analysis of the sibling resemblances which have 
ordinarily been observed. Similarly, identical twins who have lived 
apart from early infancy may be regarded as a “hereditary control” 
for identical twins reared together in the usual way. Within each of 
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these two comparisons, the degree of hereditary resemblance is the 
same. Any differences in the degree of behavioral sim.ilarity can thus 
be traced to environmental factors. 

Children who arc brought up in orphanages or similar institutions 
are in a more uniform environment than those living either with their 
own parents or in foster homes. For this reason, special interest 
attaches to the resemblances and differences which these children 
show among themselves; their resemblance to the parents from whom 
they were separated is likewise of interest. Ai?y relationship between 
the nature of the institutional program and facilities on the one hand, 
and the children’s behavioral development on the other, is also 
relevant. 

THE STUDY OF TWIN RESEMBLANCE ^ 

Beginning with the pioneer study of Gallon in 1875 (cf. 14), twin 
resemblances have served as the nucleus for a number of investiga- 
tions on heredity and environment. The earlier studies on twin re- 
semblance in mental test performance failed to differentiate between 
fraternal and identical twins, thus precluding a clear-cut interpreta- 
tion of their results. All agreed in finding a closer resemblance between 
twins than between siblings, and greater similarity between like-sex 
than between unlike-sex tw ins, 1 he latter must obviously be fraternal, 
since identical twins arc always of like sex. The groups of like-sex 
twins, on the other hand, undoubtedly included some fraternals along 
with the identicals. 

In more recent investigations, the tw^o types of twans are generally 
considered separately, and an increasing use is being made of more 
refined and dependable methods of classification. If the twins arc 
enclosed within a single sac (or chorion) at birth, it is certain that 
they are identical. Such information, how^ever, is not alw^ays avail- 
able. Moreover, two-sac pairs which are derived from a single fer- 
tilized ovum do occasionally occur. This criterion cannot therefore 
be relied upon exclusively as a means of separating the identicals 
from the fraternals. The safest procedure is to compare the twins in 
a fairly large number of physical characteristics. Close similarities 

* For a non-lechnical inirodiiction to many of the biological questions regarding 
twins and othei multiple human births, cf. Newman (46). An excellent critical survey 
of the psychological findings on twins, as well as on foster children and institutional 
groups, can be found m Woodworth (77, 78). 
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might occur by chance in two or three such traits, but if the twins 
are alike in a combination of several characteristics, it is well-nigh 
certain that they are identicals. Among the most dependable criteria 
are similarities in fingerprints, hand- and footprints, color of hair 
and eyes (including the detailed pattern of iris pigmentation), form 
and texture of hair, and shape and arrangement of teeth. Identical 
twins must likewise belong to the same blood group, and since a 
large number of such groups have now been identified, this com- 
parison also provides a fairly good index." 

Typical results obtained when different types of twins arc com- 
pared in intelligence test performance arc illustrated by a study of 
over 375 children with the Stanford-Binet (72), The average dif- 
ference in IQ v/ithin each pair of identical twms, like-sex fratcrnals, 
and unlike-sex fratcrnals is given below, together with the average 
difference among ordinary siblings; 


6.'^ 

ptiijs of idcnticjls 

5 08 

39 

pairs of like-scx fratcrnals 

7.37 

84 

pairs of unlike-sex fiaicrnals 

8 48 

99 

sinliiigs 

13.14 


The degree of twin resemblance may also be expressed in terms 
of the correlation cocfiicient. The two types of data — average dif- 
ference within pairs and correlation between paired scores — arc 
mathematically equivalent and one can be predicted from the othcr.*^ 
1'he reader may find it more convenient to visualize the relationship 
in terms of (>ne or the other compaiison. The correlations between 
intelligence test scores of identical twins are generally in the ,90’s, 
nearly as high as the reliability cociiicients of the tests themselves. 
In other words, the resemblance Hetween identical tw^ins reared in 
the same home is about as close as that between test and retest 
scores of the same individual. The correlations between intelligence 
test scores of frat'Tnal tw'ins fall betw^n those of identical twins and 
those of siblings. Such correlations are more variable from study 
to study than almost any other type » i familial correlation, ranging 
from slightly over .50 (cf., c.g., 23) to about .7() (47, 76). 

This finding is not surprising when w^e realize that the identification 

-It is inlcrcstinK to note that recent stnelie;. in elcclroenccphalopraphy also seem 
to indicate that one-egg twins manifest identical Jbrain wave patterns (cf. 78). 

^^Cf 48. Mean intra-paii difTercnce — 1 lax/l-r, in which r is the correlation be- 
tw'een pairs and o is the standard deviation of the scores within each paired group. 
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and classification of fraternal twins is subject to special selective 
factors which may operate differently in different studies. On the one 
hand, fraternal twins who are quite dissiniiliar in appearance and 
behavior arc more likely to be overlooked in any search for twins. 
Such pairs tend more oUen to be regarded as ordinary siblings in a 
cursory survey of, for example, the children in a particular school. 
This selective factor would lead to an overcstimation of the correla- 
tion between fratcrnals, since the less similar pairs arc omitted. On 
the other hand, when the classification of twin pairs into fratcrnals 
and identicals is somewhat superficial, those fratcrnals wdio are most 
nearly alike in physical and behavioral characteristics are likely to be 
mistaken for identicals, I his will have the effect of reducing the fra- 
ternal correlation, because the more similar pairs arc now eliminated 
from the group. The first of these two selective factors has been 
described by several writers (cf., c.g., 40, 7(S). 4'hal tiic second is 
also likely to operate, especially in studies in which less intensive 
criteria of classification are employed, is suggested by an inspection 
of the data or) average IQ dhfercnccs reproduced above. It will be 
noted that only 39 pairs of like-sex fratcrnals were identified, in con- 
trast to 84 pairs of unlike-scx fratcrnals. Although in gt;ncral the 
number of like-sex and unlike-scx pairs should be roughly the same, 
in this study less than half as many like-sex as unhke-sex fratcrnals 
are listed. None of the unlike-scx pairs could be mislaken for iden- 
ticals, whereas such a confusion could occur with those like-sex 
fraternal pairs who were ck)sely similar. 

Relatively few^ surveys of tw'in resemblance in special aptitudes 
have been conducted. What data arc available suggest that in these 
characteristics, too, identical twins arc much more alike than fra- 
ternals. In both types of twins, however, the resemblance in special 
aptitudes is much less than in tests of general intelligence. On a 
scries of tests of motor skills given to 46 pairs of fraternal and 47 
pairs identical twins, the correlations averaged .43 for fratcrnals 
and .79 for identicals (39 ). On the Minnesota Spatial Relations Test, 
a paper-and-pencil group test of the ability to visualize spatial rela- 
tions, a correlation of .28 was found within 33 pairs of fraternal 
twins, and .69 within 29 pairs of identical twins (4). 

In personality tests, twin correlations tend to be lower than in tests 
of ability. Moreover, in tlie personality area, twin correlations are 
more nearly alike for fraternal and identical twins than they are in 
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the case of intellectual functions. The degree of twin resemblance 
in personality characteristics also varies widely with the specific 
aspect of personality under consideration. All these findings are in 
line with the results reported in the preceding chapter on parental 
and sibling correlations in personality tests. 

On the Bernreuter Neurotic Inventory, correlations of .63 for 
identical and .32 for fraternal twins were obtained (S). Another test 
in the same general area, the Woodworth-Mathews fest of Emotional 
Instability, gave an identical twin correlation of .54 and a fraternal 
twin correlation of .28 (2h). On tests of other personality character- 
istics, such as dominance or self-sulficicncy, the test correlations tend 
to run lower (8). fhe Strong Vocational Interest Test yielded cor- 
relations of only .50 for identical twins and .28 for fraternals (S). 
Although from time to time selected cases of very close resemblance 
in the personalities of twins are reported, equally striking cases of 
dilTercnccs can be found. For example, in a study of ten pairs of 
fraternal and two pairs of identical twins located within a college 
population, tests indicated less agreement betwe^en twins tiian between 
siblings in such characterisiics as self-sufficiency, introversion-extro- 
version, social adjustment, and mascufinity-femininit} (49). Some 
evidence was found in the same sludy that a pair of twins may tend, 
somewhat more often than sil)fings, to develop opposite trends in 
dominance and submission. The implleations of such findings will 
be examined in the following section, in connection with the social 
interaction of twins. 

Mention may also be ma-te of ihe various reports of similarity in 
crifue and in imaniiy ^ among tx'ins, a topic which has proved espe- 
cially alluring to popular writers. In (uie survc> covering 13 criminals 
who were Ivnown to have an identicji twin, 10 cases were found in 
which the other member of the pair also had u criminal record. Out 
of 17 fraternal pairs included in the same survey, only 2 showed 
both twins to have been convicted of crimes (31, p. 46). Attempts 
have likewise been made, in tracing the careers of identical twins, to 
find an ‘^equivalence” between certain forms of illegal and certain 
forms of legal behavior (31, 67).'' i uj behavior of the twins was 
considered equivalent if it appeared to stem from a common “inher- 

Data on the incidence ol ..pecific psyctuyscs amon^ Tv/ins, as well as among sib- 
lings and patents and children, will be pieseuicd in ihc special chapter on abnormality, 
C'hapter 16. 

CT. also 78 for other sources. 
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ent tendency,” though differently expressed. Obviously such inter- 
pretations leave the way open for much subjective bias. It should also 
be noted that common environmental factors, likely to be greater for 
identical than for fraternal twins, could account for much of the 
observed similarity of behavior. Even the knowledge that one is the 
identical twin of a criminal might play an important part in deter- 
mining the individual’s own attitude as well as the reactions of others 
toward him. Finally, as frequently happens in studies of isolated 
cases, other instances can be found to illustrate the opposite con- 
clusion. Cases are on record in which ou^ member of a pair of 
identical twins was either criminal or clearly psychotic, while the 
other gave every indication of remaining normal fcf., e.g., 24). 

THE ENVIRONMENT OF TWINS 

Fraternal versus Identical Twins. All investigators agree in finding 
identical twins more nearly alike than fraternal twins in abilities, as 
well as in most other behavior characteristics which have been studied. 
Identical twins have identical heredity; fraternal twins do not. Can 
we, then, conclude that the greater resemblance of the former is the 
result of heredity? It is not so simple as that. The identicffl twins’ 
closer similarity of heredity is paralleled by a closer similarity of 
environment. This fact has received increasing recognition in recent 
research on twins. On the basis of extensive field study of twins, 
Carter (8) argues against the assumption that nurture inlluences are 
even approximately the same for identical as for fraternal twins. 
He wTiles: 

Such an assumption seems untenable to anyone wdio has had much 
contact with twins in their own social environment, for it is quite evident 
that the enviionments of identical twins are on the average more similar 
than those of fraternal Iwans. The identical twins obviously like each other 
better; they obviously have the same triends more often; they obviously 
spend more time together; and they are obviously treated by their friends, 
parents, teachers, and acquaintances as if they were more alike than fra- 
ternal twins arc (8, p. 246). 

Many other investigators lend support to such a conclusion. It is 
clear that fraternal twins are often quite unlike in body build, general 
health, eye and hair color, muscular strength, and many other physical 
characteristics (70). One twin may be ugly and the other handsome; 
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one sickly and the other hale and vigorous. The effect which these physi- 
cal differences will in turn have upon the twins’ relations to their envi- 
ronment may be very far-rcaching (25, 78). Each twin will, by virtue 
of his physical characteristics, automatically ‘"select” different features 
from the same environment. Actual observation has repeatedly shown 
that the amount of shared experience of fraternal twins is less than 
that of identical twins. For example, in a questionnaire (75) an- 
swered by 70 pairs of identical twins, 69 pairs of like-sex fraternals, 
and 55 pairs of unlike-sex fraternals, 43% of the identicals reported 
that they had never been separated for more than one day. Among 
the like-sex fraternals, only 26% reported this to be true. Identical twins 
more often share the same room at home, have the same chum, and 
are treated more similarly by their families and associates (30). In 
fact, it is not uncommon for one twin to be mistaken for the other, 
especially in childhood. All this furnishes an interesting illustration 
of the indirect influence which physical similarities may exert upon 
behavior. These similarities, which are themselves largely determined 
by hereditary factors, may in turn alter the individuals environment 
in such a way as to affect his behavior development. 

A word may be added in this connection regarding comparisons 
between like-sex and unlike-sex fraternals, as well as between fra- 
ternal tw'ins and siblings. I'hc greater similarity in test performance 
generally found for like-sex than for unlike-sex fraternals could result 
from cither hereditary or environmental factors. On the side of 
heredity, it will be recalled (Ch. 4) that the presence of sex-linked, 
sex-influenced, and sex-limited factors may introduce a number of 
hereditary differences between unlike-scx children the same par- 
ents, which arc not present in like-sex children. On the side of envi- 
ronment, it is apparent that the effccuve environments of a boy and 
a girl are more dissimilar than would t>e the case for two boys or two 
girls. Thus the differences in the results obtained with like-sex and 
unlike-scx fraternals do not lend themselves to unambiguous inter- 
pretation. Any differences in degree of resemblance between fraternal 
twins as a group and siblings as a group, however, can logically be 
attributed to the greater environmental s imilarities of the twins. On 
the basis of heredity, fraternal twans should be no more alike than 
ordinary siblings. But their environments will tend to be more similar. 

'^Except in so far as the selective factors discussed in the preceding section may 
have operated. 
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This is obviously true of prenatal and natal conditions. Moreover, 
being of the same age, the twins will be exposed to any changing 
influences in the home environment at the same stage of their develop- 
ment. The attitudes of parents and associates toward the children, 
as well as the attitude of the children toward each other, arc also 
likely to differ in the two situations. 

Prenatal and Natal Factors, it is obvious that any differences 
noted within a pair of identical twins must be the result of environ- 
mental factors. When identical twins reared in the same home show 
conspicuous dissimilarities in development, tiie possible role of pre- 
natal factors or of birth injuries is suggested. That prenatal conditions 
may produce delicicncies in one twin while the other develops nor- 
mally is quite consistent with what i.. known regarding the embry- 
ology of twinning. During prenatal life, the twins are compclilois lor 
the available supply of nourishment- Sometimes one twin loses out 
completely and lalls to survive, while the other develops at his ex- 
pense. When the inequality is milder, both are born, but one may be 
weaker than the other. 

An example of the possible operation of prenatal and natal factors 
in producing differenees helwcen identical twins is to be l^iund in 
the occurrence of Icehicinindcdness. In a survey of several feeble- 
minded institutions, Rosanoff ct al. (51 ) located 126 persons known 
to have an identical twin. In tffc nKijorify of these cases, the other 
twin was also feeble-minded or showed some other abnormal con- 
dition such as epilepsy, binh paralysis, or behavior difficiiities. In 1 1 
pairs, however, no delect was Icnmd in the other twin. Since the 
abnormal condition in the defective twin in these pain^ appeared 
early in iile. the prohahihty of birth irtjuries or prenatal factors is 
strongly suggested. Some in\ esiigators {cf., o.g., 51 ) con ider cciebral 
birth injuries to be a relatively common, unsuspected cause of men- 
tal defieieney. An injury too mild to attract notice at tlie time may 
nevertheless be sLiffieieiu to interfere with normal intellectual develop- 
ment later on. This point of view has been most vigorously cham- 
pioned by Rosanoff (51 ). Since twnns tend on the wditde to be born 
prernaUirelv— when thev are relativelv small and weak — they aie 
especially suojeet ic I'irih injuries. Rosanoff (51) estimates that con- 
ditions favoring birth injuries arc about eight times as frequent in 
the birth records of feebleminded persons as in the general popiiLuion. 

The prenatal and natal conditions surrounding the development 
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of twins have also been cited in explanation of the finding that twins 
as a whole tend to be inrcliccinally inferior to sinyle-born children 
(8). Such retardation has been noted troin the preschool level 
(cf., e.g., 10) through high school. In a survey of 412 pairs of twins 
located in a high school sample of 11 9,851^ students, the average 
percentile score on the Menmon Nelson Test of Mental Ability was 
39.73 for the twins and 50 for the rest of the group" (7). Further 
corroboration is furnished by the results of intensive lollow-up studies 
of triplets and quadruplets, as well as by the detailed observations of 
the widely publicized Dionne quintuplets (2, 3, 8, 10). It can be 
argued that the larger the number of individuals competing for stir- 
viva! in the uterine environment, the more severe the handicaps im- 
posed upon all of them. I'hc observed facts regarding the intellectual 
development of multiple-birth children appeal to lend some support 
to such an hypothesis. The fact that multiple-birth children arc often 
born prematurely ctniid also account in part for Iheii retaidation, 
since they are actually at an earlier stage of development than their age 
indicates. It is doubtful, however, wiiether this factor has a significant 
ctfcct upon intellectual development in later childhood; its inlluencc 
is probably limited in large part to early sensory and motor de- 
velopment. 

The Language Development of Tv^ins. A note of caution must 
be sounded against the too facile acceptance of these structural ex- 
planations of intcllectuai retardation among twins. It should be noted 
that the observed retardation is generally most marked in the acquisi- 
tion of language. This in turn has an important bearing upon other 
forms of subsequent intellectual development. Ibe backwardness in 
language may result at least in part from the presence of two (or 
more) children of identical age in t;.e same family. It is a common 
observation that twins frequently form a relatively self-sufficient unit, 
and consequently have Jess need for contact and communication with 
other children and adults. It is just these contacts, however, which 
provide powerful incentives and opportunities for learning to talk. 
That twins may be physically retarded or handicapped because of 
tne physiological conditions of twin d 'vclopmcnt seems quite appar- 
ent. But that the same conditions provide a sufficient explanation 
of their intellectual retardation is not conclusively established. 

Difference/P If. ^7; the slalisiiCtti sienificance of this difference is there- 
fore very high. 
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Let us examine some of the specific data on language development 
among twins. Figure 65 shows the development of the Dionne quin- 
tuplets in (a) total intelligence test scores, (b) motor functions, and 
(c) the acquisition of language. It is apparent that the greatest re- 
tardation is in language and the least in motor functions. The indices 
of “general mental development” occupy an intermediate position, 
probably because of their composite nature. Althougli the quintuplets 
were born two months before the normal term, and although the 
possibility of fetal handicaps exists, it is unlikely that such conditions 
would produce a more marked retardation in language than in motor 
development. In his discussion of the linguistic retardation of the 
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Fig. 66. Linguistic Development of Twins and Singletons. (From Dyvis, 
10, p. 136.) 

quintuplets, Blatz (2) calls attention to a number of likely environ- 
mental factors. Since most of their Wu Us were anticipated by ever 
vigilant attendants, the children had little need to communicate with 
adults. They had little to tell each other, since they shared most 
experiences. By age three, moreover, they had developed a number 
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of mutually intelligible gestures and cries to express their feelings 
among themselves. 

(jroup surveys of triplets (27) and twins (10, II) have yielded 
similar results. Special systems of communication, through gestures 
and vocal cues, are frequently developed by twins out of their com- 
mon experiences. The need for acquiring the language of adults is 

thus reduced. Specific indices of 
language dcvelopinenl, such as 
length of response or number of 
different woids used during a 
standard observation period, show 
consistently more rctaroation than 
is found in the total 10 of such 
children. The extent of this lin- 
guistic retardation is illustrated in 
Figure 66. Bringing together the 
diita of several investigators, this 
figure shows the average niirnber 
of dilTeient words used during 
the examination period by t^vins 
and singletons (singly born chil- 
dren) between the ages of V/i 
and 9^ 2 . The dilferenee between 
the two groups appears to be 
largest from ages 3 to 5. and decreases somewhat during school 
ages.^ 

That the contact of twins with each other is a major factor in their 
linguistic retardation is further suggested by the iinding that only- 
childtcn arc definitely supcrio-r to clii!dren-with-s;hiiiigs in every 
phase of linguistic skill (10), In fact, singletons-wilh-sibiings re- 
semble twins in many phases of their language development more 
closely than they resemble orily-ehiidren. Jn Figuic 67 will be found 
the number of dilTerent words used during a test period by twins, 
singletons-with-siblings, and only-children at ages 5f'2, 6 V 2 , and 
9 V 2 . It wall be noted that the singletons-with-siblings arc somewhat 
closer to the twins than to the only-children at the first two age levels, 
and about midway at the third. Also relevant is the finding that 

The very small tlilTercncc at age 6 ' 2 may be due to the relatively small number 
of cases examined at this age. 
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Fig. 67. Fomparisou of the T an- 
guage Development <.>1 Twins, Sin- 
gletons walh Siblings, and Only 
Children. (From Davis, 10, p. J 12). 
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children who spend more time with adults tend in general to be 
linguistically superior (38). 

In conclusion, the ret,irding eftcct of the “twin en\ ironmcnt"' upon 
language development seems to be quite clearly demonstrated Lin- 
guistic retardation m turn has lar-rcaching implications lor all intel- 
lectual development Not only is language ncccssaiy as a means of 
communication in most human leiimng but linguistic symbols them- 
selves pla> an impoitant p irt in pioblem solxin*^ and m the more 
abstract and complex human intclleetual limUions 

Social Interaction. I he social reactions ot twins toward each 
other provide a promising iielj ol investigation in themselves Many 
obscrccrs have called ciltintion to the \juuLilrati()n of "toUs ’ which 
twins oltcn seem to work out b> a tacit mutu.il agreement (47, 78). 
Such a division ot laboi — oh erved espc»,nll> among identical twins — 
makes tor more lutrmop ous rckitionship^ and economy of i ^Tort 
Thus one twin may be the spokesman for the pair m encounters with 
other persons, showing more interest in people and responding more 
actively to tiKm rrequeiil))/ one twin is the dominant number of 
the pair, tending more olten to Iciie^ and to make the decisions lo** 
both (11, 78) Sueh a diflereiiti ition ot roles may onginally aiisc 
from slight ddleieiues m si/e and stienglh, wln^h may hive been 
prcnatally established J he paients’ eflorts to discovei and empha 
S17C any distinguishing mirk between the twins may be a luithcr 
so irce ol dilTerLntiatkm In some case*-, minor chance happenings 
may initiate the ditlercnce, whieli is Ih^n willmgly accepted and de- 
VL loped b} the twins a in it< ol convenience. 

Such a divison of rok s, eomuu’ed and augmented ever the years, 
coilJ aecoLint tor some ol the dilleienccs in interests, attitudes, emo- 
tional re letions, and abihtu > sonieliru*- found between twins reared 
in the same home I oi cx miplc, the rclaliv'clv large deferences be- 
tween twins in such ti nls as dominance, sclf-sulhcicncy, introvcision, 
and the like, found m the preMou h cited stud> on a college popula- 
tion, arc understandable on thi^^ basis (49) I'lese hndings arc sup- 
ported by more detailed case' studies of n iividual twins (el , c g 42) 
and are borne out in very interesting ays by the obscrvatlon^ of 
larger mulliplc-birth groups The Dionne ejuintuplcls, tor example, 
although reared undci as nearly uniform and controlled conditions 
as any group ot children, nevertheless show clcar-cul personality and 
ability dillerences (2, 3). Yet the conclusive demonstration ot their 
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identical heredity precludes any explanation of such difTerences in 
terms of heredity. In commenting upon these findings, Blatz writes: 

It is in the environment, apparently the same for all, that there never- 
theless exist subtle yet important differences in the influences bearing on 
these children — dilfcrcnccs of which the social interaction of the five, one 
upon the other, is the most emphatic yet the most difficult to identify and 
measure (2, p. 174). 

Another vivid demonstration of environmentally determined dif- 
ferences amerng identical twins is furnished b/ the Morlok quadru- 
plets of Lansing, Michigan (16), On the Stanford-Binet Intelligence 
vSeale, one of these quadruplets received an IQ of 110, another 101, 
while the other two occupied intermediate positions. This relation- 
ship was consistently maintained on other tests of intelligence or 
scholastic ability. On the Stanford Achievement Test, ior example, 
the “bright” twin earned a total score of 124 and the “dufl” one 96. 
Each of the sub-tests of the Stanford Achievement lest showed the 
same relationship. Physical and personality differences among the 
quadruplets closely paralleled these intellectual diiferences. The dull- 
est twin was also the smallest and had had a poorer health histc'^ry 
throughout childhood. The investigators suggest the possibility (Tf dif- 
ferences in fetal blood supply as a basis ior both the physical and 
the psychological dissimilarities. It may well be that the initial, pre- 
natally determined, physical diiferences led to a subsecjuent social 
diversification of roles among the four sisters, which in turn affected 
their subsequent emotional and intellectual development. The per- 
sonalily diiferences among the four arc reported to be especially 
conspicuous. The children have been characterized by their parents 
as “the boss,” “the clown,” '4he artist,” and ''the baby,” and the 
investigators report that an oulside observer could readily identify 
the child fitting each of these labels, even from a brief observation. 

TWINS RI'ARf D APART 

Of considerable interest are the case studies of identical twins who 
were separated at an early age because of death of parents or other 

•'Those :iT 0 the oaH knov.^n lixini: one-c‘j:g quadniplets. A seiies of studies of 
other qLKidnipleis li.ive been lepoueJ by 1. C Cmitlner uiiel If H. Newman in the 
Journal (ff Ilcfcdify, Jl, 307 314,'*4iy 434, 1942, 33, 311 314, 345 350; 1943, 

34, 27 '^2. 
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misfortunes, and were reared in separate homes. About twenty-five 
such pairs have been located and carefully studied. The most exten- 
sive collection of cases has been assembled in an investigation at the 
University of Chicago, conducted by Newman, Freeman, and Holz- 
inger, a geneticist, psychologist, and staiislician, respectively. The 
principal study (47) covered 19 pairs of identical twins, most of 
whom had been separated since their first year of life. The actual age 
of separation in individual cases ranged from two weeks to six years. 
All the twins had lived apart up to the time of their examination, 
although in one or two cases they had corresponded or occasionally 
visited each other. The ages at the time of testing ranged from 11 
to 59 years. 

Fach case was intensively studied through physical measures, 
psychological and educational tests, and persona! inlerview's. Data 
were also obtained regarding the foster home and foster parents, 
educational and vocational history of the twins, health and disease 
records, and other relevant factors in the subjects' experiential back- 
ground. A case history illustrating the elfecis ol two fairly dissimilar 
homes upon a pair of identical twins is summarized below. 

Case #4. Mabel and Marv, 29-year-old twins, had been separated at 
the auc oH five months and reared by relatives. Mabel had led the life of 
an aetivc farm woman on a prosperous farm. Mary had hved largely a 
sedentary lile in a siiia!! town, clerking in a store during the day and 
teaching music at night. Mabel had only an elementary school education 
in a rural school, while Mary had had a complete high school course in 
an escellenl city school. At the '-rnc o! examination, a vast dilTerence was 
noted bctw'cen the twins in inlelleUiial, emotional, and physical traits. 
Physically, Mabel is described as lobiist, muscular, and in perfect health, 
while Mary was underweight, soft muscle^, and in poor general condition; 
Mabel weighed LTS'A lbs., Mary only 1 10'-; lbs. Inicllcctually, an equally 
sti iking ditfercnce was found, but in favor of Mary, w'hose StanfordTiinet 
IQ w\as 106 as compared with S9 for her sister. Fven larger ditfercncc.s 
were v')btaincd in some of the other tests. In personality characteristics, 
the twins exhibited consistent ditlcrences, as determined both by tests and 
by direct observations. The rural twin tended to be more stolid and stable 
in emotional responses, to give fewer neurotic reactions, worry about 
fewer things, and respond less emotionally to stimuli than did the urban- 
bred twin. Both the physical diflerences noted above and the contrast 
between their psychological environments probably account for these per- 
sonality diflcrcnces between the twins (47^ pp. 187-195;. 
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TABLE 15 A Comparison of Identical Twins Reared Apart with 
Fraternal Twins and with Identical Twins Reared Together 


(Adapttrl from Atwman, I'rconian, and IJoiziTipcr, 47, jip. 72 , ^ 7 , J44, 347. and 
Woodwoilb, 7fi, p. 



Mean Difjcfcncc between 

'[ w'lns 

Measure 

Fiatenval 
(50 pair^) 

Ulcnucal 

T o gel he I 

( ’4) pLlUS) 

Identical 

Sepal aled 
(19 pairs) 

Height in cm 

4.4 

1.7 

1.8 

W'cighl in lb. 

iO.O 

4.! 

9 9 

Bind IQ 

9 9 

5 9 

8.2 


Cot U’kt 

u fi.s /}' ’tween 

f \\ ins 

Measure 

Fratci nal 
(.50 p<nis) 

Identical 

Toueihc! 

( 50 puns) 

Identical 
SepuMted 
( ! 9 jiairs ) 

Height in cm. 

(A 

.9^ 

.97 

Weight m Ih, 


.92 

,89 

Bind IQ 

.63 

.88 

.“^7 


* C (‘rfain y ‘,iatisiical conct iiims, for for incqualilic's iri IQ /ancc, 

have been applieil lo ihc ongm.d coi iclalions U)Vcn h} Ncwmaji. I itvnitffi, aiul 
llol/ingei (47). I'oi iuiliici dojails oa these (.(MjCcIious, see 2o, 40, aiul 16. 


To obtain comparaiive data, Newman, Freeman, and ITolzingcr 
tested 50 pairs of UlcnTual iwbis liviiig together and 50 pairs of 
fraternal twitis, also Ii\ing together and with their own lamilies. 
Mean differenees as well as correlations in height, weighu and IQ 
for all three groups arc shown in 'Fable 15. It will be noted that iIjc 
average IQ dilTercncc between the separated identicals is 8,2, slightly 
less than the mean difference between fraiernals, but slightly larger 
than that between non-separated identicals. Fissenliaily the same rela- 
tionship is brought out by the correlation coetlicienls, the separated 
identicals falling between the non-separated idemicals and the fra- 
tcrnals in closeness of resemblance. 

Several A\riters have tried lo draw inferences regarding heredity 
and environment from a comparison of the degree of resemblance 
in physical and intellectual traits in these three groups. In d abic 15, 
the results for height, weight, and IQ are not startlingly different. To 
be sure, tlie height and weight correlations for the two groups of 
identical twins arc higher and more nearly alike than are the cor- 
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responding intelligence test correlations. But we must bear in mind 
that measures of height and weight are more reliable than measures 
of intelligence. Chance errors of measurement in inteliigcnce tests 
would introduce more lliictuations in score, even in retests of the 
same individual. Such chance iluetuations lend to lower the correla- 
tions oF lO’s and render any dilTcrenees between such correlations 
less significant. In other words, we cannot conclude with certainty 
from the data in I’able \5 that the clTect of separation was any 
greater on IQ than it \\as on height and weight. At the same time, it 
should be remembered that such comparisons between familial re- 
semblances in physical and in psychological characteristics, although 
of S(une interest in ihcmsclves, really tell us little about the problem 
ot heredit}^ and environment. Similar cr^rrclations can result from 
diflerent factors, and their similarity is therefore no indication that 
the same influences iia\c operated. 

A more crucial approach is provided by an analysis based upon 
the extent of environmental differemes between the tw^(’) tw ins in each 
separated pair. The mere separation of the twins need not in itself 
lead to ditTerences. It is eonceixable, as a matter of Fact, that a par- 
ticular pair oF tw^ins reared apart may be more alike than if they 
had been reared together. If their respective cinironments arc closely 
similar — although geographically remote — identical twins should re- 
spond with considerable uniFormity. Their physical likenesses, based 
upon a common hcreditv, w'ould in such a case insure like responses 
to li]:e stimulation. Brought up together, on the other hand, the same 
two twins might show' diverge ^ ct^urses ol development owing to the 
specialization oF ‘Toles” discussed in the preceding section. Psycho- 
logically, environment is not geography . 

In Table 16 will be found individua! data on each pair of separated 
identical twins. The original 19 cases of the Chicago study have been 
augmented with a twentieth case subsequently reported by Gardner 
and Newman (15). The JQ dilTcrenees in the last column indicate the 
excess in Favor oF whichever twin received tht better education. An 
examination of these 10 differences suge<'sts that, on the whole, they 
arc not random djlTcrences such as might result from fortuitous fac- 
tors, but rather tend to favor the better educated tvvin quite con- 
sistently. If we restrict our comparisons to the five pairs which present 
large differences in amount of schooling (first five cases in Table 16), 
the mean JQ difference in favor of the better educated twdn is 16 
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* The first 19 cases are troni Tstwman. Fiecnia.i. and IIolzinLic. (4"); C ast.- 20 later added to the collection by Gardner and New- 
man (15). Five iiddit’onal ca'.cs, studied by other inve'.iigatois. aic cited in the text Thc> ha\e not been included in the table since the 
data do not lend themselves to compaiable evaluations in teims of iIk above categoiics. 
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points. It will be noted that in the remaining cases the differences 
in schooling are small or non-existent. In so far as schooling may 
affect the IQ, then, these remaining twins would not be expected to 
differ much. And the differences are, in fact, small. If the cases 
with similar educational opportunities, where little or no difference 
is expected, are averaged with those showing clear-cut educational 
differences, then the possible effects of this environmental factor are 
diluted and underestimated. A composite figure based iipcm all these 
cases, in which specific conditions varied so widely, would only 
obfuscate the results. 

On the basis of the case material, the environments of the sepa- 
rated twins in the Chicago study w'crc r iicd by five judges for degree 
of intra-pair difference in educational, social, and physical or “health” 
advantages, respectively. These ratings arc also given in Table 16. 
Since each of the judge* used a l()-p(fmt scale, their combined rat- 
ings for each characteristic could have a maKimum value of 50. The 
higher the rating, the greater the estimated difference in environ- 
mental advantages between the two twins in each pair. These ratings 
show interesting correspondences with the observed dilferences in 
intellectual, cnKHional, and physical characteristics of the twins. Thus 
a correlation of .76 was found between the discrepancies in educa- 
tional advantages and the discrepancies in IQ within each pair of 
twins. IQ dilTerences correlated .5 I and .30, respectively, with judged 
differences in “social” and in “physical” environments.’’ Tw^in dis- 
crepancies in body weight, on the other hand, correlated .60 with 
discrepancies in the “physicaF' en\ irorinienls. 

It should be noted, moreover, that the types of homes into which 
the twins in any one pair w-ere placrJ rarely differed very much. 
If an experiment were being designed to lest how far cn^'ironment 
may affect, for example, the 10, the twins would obviously be placed 
in as different homes as possible. But in the actual placement pro- 
cedures followed, the reverse tendency probably operates. The place- 
ment of children in foster homes tends to be selective, an effort being 
made to place the children with famifv'., similar to their own. In a 
number of twin pairs, the children W'crc adopted by relatives, ff'his 

^^Two cases, 1 and 8, shov^ evidence in their histories of a possible prcnauil 
handicap, wh.ch tended to make the Iviins unlike. It these two cases are omitted, the 
remaining cases sliow an insignilicant mean dilTcicnce of less than 2 IQ points. 

The coi relation of ..10 is not statistically significant; the other jeported correla- 
tions meet the usual standard of significance. 
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would certainly make for greater similarity in socio-economic, educa- 
tional, and other characteristics of the two foster homes than would 
be the case between two ^'amilics picked at random. 

Five additional pairs of separated identical twins, studied by other 
investigators, tend to corroborate the major findings of the Chicago 
survey. A case reported by Muller (44) in 1925. the earliest on 
record, showed a negligible diliercnce between the twins on two 
group tests of intelligence. Although one twin had had only 4 years 
of formal schooling and the other 13, the educational levels of the 
foster parents and the socio-economic levels of the tw^o luunes were 
closely similar. Both girls are said to have read “voraciously,’' a fact 
which may have helped to counteract the differences in schooling. 

More recently, two pairs of separated identical twins have been 
studied in this country (6, 6S), and two pairs in Great Britain 
(52, 79). In none of these four pairs w'as there any dilfcrencc in 
amount of schooling received. Other differences in opportunities for 
intellectual development, as suggested by the descriptions of the 
homes, foster parents, or type of education, appear to have been 
cither minor or counterbalanced within the pair. For example, if one 
member of the pair w<is handicapped by frequent changes of ^'bool- 
ing, she had the advantage of higher socio-economic level of the 
foster home, in comparison with her tv\in. In three of these four 
cases (6, 52, 79), diii'crcnccs in intelligence test scores vvere uni- 
formly small and insignificant, although a number of differences in 
attitudes, social conformity, ;ind other personality characteristics 
were noted. 

All interesting divergence in special aptitudes is presented by the 
loiirth recently reported case, that of a pair of British twins sepa- 
rated at 3 months and reared apart until the age of 16 years (79). 
Although both had received the same amount of schooling, one twin 
had a Stanford-Bincl 10 of 125, the other of 106, The twin with 
the lower Stanford-Binet 10, however, excelled consistently in per- 
formance tests and in tests of mechanical aptitude; these diflerenccs 
amounted to as much as two years in mental age and nearly 30 per- 
centile points, respectively, in the tw^o types of tests. Moreover, the 
twin with the higher Stanford-Binet 10 (and lowxT mechanical and 
performance test scores) was inferior to his co-twin in height, weight, 
and general health. The possible effect of prenatal and postnatal 
environmental factors upon physique and health, which in turn might 
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influence the divergent development of interests and aptitudes, is 
suggested by these results. 

To the case studies of identical twins reared apart may be added 
the educational experiment reported by Schmidt (53 ). In this investi- 
gation — the major portion of which was discussed in Chapter 8 — 

9 pairs of identical twins were included within the total group. It will 
be recalled that in this study a number of children originally classi- 
fied as feebleminded were able to make a relatively normal educa- 
tional, intellectual, and vocational adjustment, as a result of a specially 
designed three-year educational program. Among the 9 pairs of iden- 
tical twins, one member of each pair participated in the special 
program, while the other remained either in regular public school 
classes or in the usual ungraded classes provided for backward 
children in the public education system. 

The average IQ of the twins in the experimental group rose from 
54 to 92 in the course of ttic special program; the control twins 
showed a negligible change from 61 to 59 during the same period. 
Jt will be noted that, initially, the average 10 of the control twins is 
higher. This initial dilference results from the faci that when the two 
twins in any pair dillered appreciably in 10, the one with the lower 

10 was chosen for the experimental program. Because ol this pro- 
cedure, it is unlikely that the final advantage in 10 in favor of the 
experimentally trained twins could result from possible prcnatally 
determined structural advantages. On the other hand, the regression 
cfl'cct (ef. Ch. 8 ) vvould account for a slight tendency for the initially 
higher control group to lo^e nd for the initially lower experimental 
group to gain. Regression alone could hardly account for a large part 
of the observed rise in 10 in the experimental group, however, espe- 
cially in view of the high reliability of the Stanford-Binct during the 
elementary school ages covered by the study. Emotional readjust- 
ment may have played a significant part in the improvement in intel- 
lectual, educational, and social performance of the experimental 
subjects, since such readjustments were an integral part of the pro- 
gram. Individual remedial instruction, better work habits, and im- 
proved attitudes and motivation all undoubtedly contributed to the 
gains in IQ in individual cases. 

Methodologically, the twin analysis in Schmidt’s investigation falls 
midw^ay between the case studies of twins reared apart and the train- 
ing experiments by the method of co-twin control cited in Chapter 6. 
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It is broader in scope, of longer duration, and concerned with more 
complex behavior functions than the previously cited co-twin-control 
studies. At the same time, the separation was not so complete nor did 
it begin so early as in the case studies reported in the present chapter. 
During the threc-ycar period of the experimental training, however, 
the twins in the Schmidt study were probably exposed to more dis- 
similar stimulation than was true of most of the accidentally separated 
pairs studied by others. 7 here is undoubtedly need for more experi- 
mentally controlled co-twin studies of the development of complex 
behavior junctions. Such an approach should prove to be the most 
promising of all those employed in the study of twins. 

FOSXrR C niLDRFN 

How Well Do Foster Children ^‘Turn Oiir'? The development of 
children reared in foster homes is of considerable interest for practical 
as well as theoretical reasons. Is there any basis for the popular belief 
that adopted children “tuin out badly’’? On the whole the answer 
appears to be “No,” although the contributing factors are too many 
and too complex to permit a categorical denial. Follow'-ups of a^roup 
of 910 adopted children indicated that as adults the majority had 
made a satisfactory vocational and social adjustment (73, 78). A 
little less than a fourth were judged unsatisfactory in their adjustment 
because of educational backwardhess, shifLlessness and dependency, 
or delinquency and crime, riiis proportion is larger than that in the 
general population, but smaller than would have been expected if the 
children had been reared in the unfavorable enviromnenls from 
which they were frequently taken. Within the adopted group, a rela- 
tionship was found between the quality of care and child training 
provided by the foster home and the number of foster children judged 
to have made a successful adult adjustment. In the homes rated 
“excellent" in this regard, 87% of the foster children fell in the satis- 
factory adult category; in the homes rated “poor,” only 66% were so 
classified. 

On intelligence tests, foster children as a group lend to fall some- 
what below^ “own children” brought up in comparable homes (5, 22, 
78), but above the average of the general population. At least two 
factors may account for the latter dilTcrence in favor of the foster 
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group. First, placement agencies as well as foster parents tend to 
choose the most promising children for placement in foster homes, 
while the more poorly qualified lend to remain under orphanage or 
boarding-home care. Secondly, the same type of selection occurs with 
reference to foster homes, the more undesirable homes at the lower 
end of the distribution being disqualified for adoption purposes. Foster 
children as a group are thus reared in homes superior to the general 
average. It is also likely that foster parents, on the whole, have a rela- 
tively strong interest in children; otherwise they would not have gone 
out of their way to adopt one. 

Why are foster children less successful — in intelligence test per- 
forr^ance as well as in adult achie\emc:it — than other children reared 
in the same type of home? A number of psychologists put the burden 
of explanation upon unknown ‘liereditary innncnces." presumably 
this means }*enefiuilly tietennined strtu tuin! limiuitunis on behavior 
development. Such limitations may play a signilicant part in individ- 
ual cases, but little or no direct information is available regarding 
what they arc. Part of the explanation, on the other hand, may be 
provided by prenatal and natal environmental factors. Such conditions 
as diet and medical caic of the mother during pregnancy a.nd parturi- 
tion arc probably inferior, in general, for the f(>ster group, lhat these 
conditions may affect the structural development — and indirectly the 
subsequent behavioral development — of the child is being increasingly 
recognized (62, 71). On the basis ol extensive observations at the b'els 
Research institute, for example, Sontag writes: 

Contrary to earlier v>pinion -the piogress of a Ictus and of an infant 
is considerably inniicnccd by tlie quality the diet ol his mother during 
the gestation period. . . . thcic is increM /.ng evidence ol the tremendous 
importance of maternal nutistjon and variations in endocrine t unction m 
determining the physique, ph\siology, and progress of the neonate. It is, 
it seems to me, self-evitienl that the physical anu physiological adequacy 
of the neonate are in turn critical lactors in his cmaiional and social adap- 
tation during infancy and iherclorc throuHiout life (62, pp. 151-154), 

When the child has lived with his own family o"* in an institution 
for several years prior to adopti<m, the possible influence of such early 
home environment needs, of course, to be taken into account. A further 
factor to consider is the nature oj the family relatumship in foster 
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homes. The attitude of foster parents toward a child may differ in 
some essential ways from that of own parents. In some cases, the con- 
tact of foster parent and child may not be so close or intimate as that 
of a child and his natural parents. The child himself, when he knows 
of his adoption, may react differently toward his foster parents than 
he would toward his own parents. Social expectancy may also compli- 
cate the situation. Parents as a rule expect their own children to 
resemble them in intellectual and emotional development, and this 
expectation may be manifested in their beha\ ior toward the child, as 
well as in the attitudes of other relatives and associates. As the child 
develops, his observers repeatedly call attention to points of family 
resemblance, real or imagined; he is frequently reminded of ancestral 
characteristics, which are held up to him as his heritage. Social influ- 
ences of this sort arc absent or greatly minimized in the case of foster 
children. It would be diflicult to estimate what subtle motivational dif- 
ferences may arise as a result of such differences in social expectancy, 
and what effect the motivational factors may in turn have upon the 
subsequent course of intellectual development of the child. 

Fo.ster Cliihlren and the “Nature-Nurture'"' Question. To psychol- 
ogists, foster children have provided one more approach to a j*ossihlc 
determination of the contributions of heredity and environment to 
intellectual development, and a number of investigations have been 
especially designed with this pro^?lem in mind. Three mnjor types of 
analysis have been employed for this purpose: (1) comparison of 
foster family and own family resemblances; (2) study of the relation- 
ship between foster child’s iO and level or quality of the foster home; 
and (3) determination of change in child’s IQ following adoption and 
residence in the foster home. The four most extensive investigations 
are those conducted by F. N. Freeman and his associates at the Uni- 
versity of Chicago ( 13 ), Burks at Stanford (5 ), Leahy at Minnesota 
(35), and Skodak and Skecls at Iowa (59, 60, 61). Two of these 
studies emphasize the contribution of heredity and the other two the 
contribution of environment. A brief examination of their procedures 
and findings will show that their discrepancies arc more apparent 
than real. 

Burks (5) administered the Stanford-Binet to 214 foster children 
and their foster parents, as well as to 105 control children and their 
own parents. 1 he control group was closely equated with the foster 
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group in age of children and parents, educational and occupational 
level of the parents, and cultural characteristics of the home. All sub- 
jects were white, English-speaking, and quite homogeneous in national 
and cultural background. Each foster child had been legally adopted 
by a married couple, the two foster parents being alive and living 
together at the time of the study. Each control chiiil was likewise in a 
home in which both parents were li\iiig. Only foster children who had 
been placed in the foster home under the age of 12 inontlis were 
included, the a\erage age of placement being three months. At the 
time of testing, the children ranged in age between 5 and 14. 

The correlations between Stanford-Binet mental ages of the parents 
and IQ's ol the children, for both losier and control grcnips, arc pre- 
sented below. The correlations between child's IQ and a composite 
cultural index of the home are also given. 


Coftclation hi'twccn 


C hihis IQ and: 

roster 

Contfol 

F aihc'r's M \ 

.07 

.45 

Mother's MA 

.19 

.46 

Cullural index of home 

.25 

.44 


Since the resemblances in the control group, attributable to heredity 
plus environment, are consistently closer than those in the foster 
group, attributable to environment alone, Burks concludes that hered- 
ity is much more important than environment in determining individ- 
ual diflcrcnccs in intelligence. She estimates that 

The maximal contribution of the best home environment to intelligence 
is apparently about 20 IQ points, or less, and almost surely lies between 
10 and 30 points. Conversely, ihe Uaist euilured, least stimulating kind of 
American home environment may depress the IQ as much as 20 IQ points 
(5, p. 309). 

In the investigation by Leahy (35), the same general procedure 
was followed as in Burks’ study, with certain improvements. The Otis 
Self-Administering 'Fest (Intermediate Form) was substituted for the 
Stanford-Binet in testing the parents, this test being better adapted to 
the adult level than was the Stanford-Binet. The matching of the 
experimental (adopted) and control groups was done very meticu- 
lously, each adopted child being paired with a control child of the 
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same sex, age (within six months), father’s occupational level, and 
father’s and mother’s schooling.^ ^ The age of adoption was even lower 
than in Burks’ study, all children having been placed in the foster 
homes at six months of age or younger. 

All other conditions in both experimental and control groups were 
the same as in Burks’ study, except that Leahy’s foster group included 
only illegitimate children, while less than 80% of Burks’ group were ille- 
gitimate. This difference probably introduces certain selective factors. 
Illegitimate children come from families of varied socio-economic and 
intellectual background; whereas other foster children, adopted be- 
cause of incompetence or poverty of parents, or similar reasons, lend 
obviously to come from low'cr-levcl families. On this basis alone, ille- 
gitimate children as a group would be expected to be average in 
heredity. But it has also been shown (34) that the parents of those 
illegitimate children who are placed for adoption are of higher average 
educational and vocational level than those parents who retain their 
illegitimate children. The dilference probably results from the greater 
sensitivity to social disapproval among persons in the higher educa- 
tional and socio-economic levels, who would thus be more reluctant 
to retain an illegitimate child. This additional selective factor; might 
make the group of adopted illcgilimate children superior in heredity to 
the general population. It has also been found that those illegitimate 
children adopted at earlier ages tend to come from the highest socio- 
economic levels (34). 

All these factors suggest that a group of illegitimate children 
adopted before the age ol six months, such as that studied by Leahy, 
ought to be superior from the standpoint of heredity. Leahy’s group 
did in fact excel in fO, averaging 1 10.5. This average is obviously 
higher than that of the general population, and it is also higher than 
the 10 of other groups of foster children previously tested. 1 he dif- 
ference could logically be attributed to heredity, on the basis of the 
selective factors discussed above. But it should also be noted that a 
certain amount of selective placement occurs, the children whose owm 
parents are better educated and socially superior being routed by the 
placement agencies into superior foster homes. The intellectual supe- 
riority of such a group might thus be the result of their having been 
placed in better foster homes. 

^“The Iasi three categories apply to own parents in the control group and to 
fostei parents in the experimental group. 
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The principal comparisons made by Leahy, paralleling those of 
Burks, arc given below. 


Corrclaiion between 


C hiUI's IQ (iml: 

l<>\tcr 

Control 

Father's Otis score 

.19 

,51 

Mother’s Otis score 

.24 

.51 

Cultural index of home 

.26 

.51 


Various other comparisons were made which led the author to con- 
clude, with Burks, that heredity is the major inlluence in the deter- 
mination of intellectual level. 

In interpreting the correlalicms reported by Burks and Leahy, a 
number of factors nuisi be taken into consideration, f irst, as previ- 
ously mentioned, intra-family relationships may not be strictly com- 
parable in foster and own homes. The child's knowledge of adoptiem 
may alfcct his attitude toward liis foster parents and foster siblings, 
as w'cll as his self-conlidence and his accomplishment. In Burks’ 
group, 35^/^) of the children knew of their adc>plion, and in Leahy's 
group 50%. Lhe parents, of coarse, always know (>f the adoption, and 
their reactions toward the child may be aliccted by such knowledge 
in countless ways. Of some relevance in this connection is the finding 
that two unrelated foster children reared in the same home tend to 
resemble each other more closely in IQ than an adopted and an own 
child reared together. To be sure, the obtained differences in correla- 
tion are slight and the groups of subjeets available for such ct)m- 
parisons loo small for conclusi' rc^ults. But it is interesting to note 
the consistency of this finding in dilTerenl studies (5, l.T 35). 

A second consideration is the role of natal and prenatal factors, 
which has also been previously dis«. usscil Such laetors. although en- 
vironmental in nature, would tend to increase the resemblance of 
children to their own parents, as contTasted with their resemblance to 
foster parents. Mothers who are intellectually and socially or econom- 
ically inferior would also be more likely to prew idc inferior prenatal 
care through ignorance, irresponsibility, or poverty. It might seem 
that prenatal environmcritai factors could \:M account for resemblance 
to own fathers. More careful consideration, however, will show that 
the father’s educational, vocational, and economic level will also in 
part determine the quality of medical, dietary, and other conditions 
affecting the mother. 
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A final point should be noted. In several respects, Leahy's study is 
better controlled than Burks’ and consequently its results should be 
regarded as more conclusive. There is, nevertheless, one serious dis- 
turbing factor in Leahy's data. There arc several indications that, 
despite the care with which the foster and control groups were 
matched in parental education and occupation, the cultural levels of 
the foster and control homes were not truly comparable (74). I'hc 
average over-all environmental rating was 137.9 for the foster and 
118.7 for the control homes; the corresponding vSD's were 54.3 and 
59.6, respectively. In other words, the fosier homes were on the 
whole superior and more uniform than the control homes. Such uni- 
formity would obviously serve to decrease the contribution of home 
environment to individiial differences in 10. In fact, if home environ- 
ment plays a significant part in intellectual development, we should 
expect the foster children in this group to be more alike in 10 than 
the control children. Such was indeed the case, the foster JO's having 
an SD of 12.5 and the control an SD of 15.4. Further evidence for the 
greater cultural homogeneity of the foster homes is furnished by the 
average environmental ratings of homes in different occupational 
categories. The foster homes in which the father is a semi- stilled or 
a day laborer arc much less inferior than arc the control homes in the 
corresponding occupational category, the average environmental rat- 
ings being 74.7 and 40. L rcspec^tivcly. Apparently, matching father's 
occupation was not a suflleicnt control, since within the same occupa- 
tional level those homes wliich arc approved for adoption purposes 
tend to be at the upper end of the distribution. What is even more 
imp(^rtant is that homes in different occupational categories were 
more alike in the foster than in the control group. It is not surprising, 
in view' of this situation, to find that the lO’s of the foster children 
also differed less from one occupational category to the other than 
did the lO's of the control children. 

The investigation of Freeman et al. (13) employed a wider variety 
of approaches but was less well controlled in certain important re- 
spects than the Burks and Leahy studies. One of its principal weak- 
nesses is that the age of adoption was much higher, averaging four 
years for the entire group of 401 foster children tested. The subjects 
were somewhat more heterogeneous in national, racial, and socio- 
economic background than in the other two studies. The children 
were tested with the Stanford-Binet and the International Group 
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Mental TestJ'^ the foster parents were given the Otis Self-Administer- 
ing Test (Higher Form) and a specially constructed vocabulary test 
covering many fields of knowledge. Field workers collected data on 
the education, occupation, and cultural level of the foster parents, as 
well as on the condition of the foster home. Information regarding the 
natural parents ot the foster children was obtained whenever possible 
through visits, interviews, and examination of case records. 

In order to facilitate various comparisons, the children w^cre classi- 
fied into four overlapping groups, and the data analyzed separately 
for each group. Group I. the pfotest ^^ronp, consisted of 74 children 
who had been tested before adoption and who had lived in the same 
loiter home until the time of the second examination. \ he average age 
of these children at adoption was eight years and their average pciiod 
of residence in the foster home at the time of the study was four years. 
After residence in the foster home, the average 10 of this group 
showed a small but fairly reliable gain from 91.2 to 93.7.'*^ When this 
group of 74 children was divided into those adopted in^) the belter 
and those iKJc'jplcd into the poorer foster homes, the former showed an 
average rise of 5 IQ points, while the lattei sliowed no change. Simi- 
larly, those children adopted earliei showed more improvement than 
those adopted later. Allhough a^'e^)ge gains w^ere small, the results of 
these various comparisons tended to be mutually corroborative. 

Group II, the sibling ^ronp, was composeil of 125 pairs of siblings, 
each adopted ink; a dilfcrent foster iiornc and separated for a period 
4 to 13 years. \ he a\erage age at winch the siblings were first sepa- 
rated v/as 5 years-4 months. I ci‘»;iirasi to the correlation of about 
,50 ordinarily found hetween sibling.*, reared in the same home, the 
IQ’s of tiicse separated siblings correlat*, i only .25. 1'hc scores of 63 
siblings adopteii into homes which recei’ jd significantlv dilTerenl cul- 
tural ratings correlated only .19; those of siblings adopted into similar 
foster homes correlated .30. Ihcsc discrepancies in correlation are all 
the more impressive when it is recalled that the siblings had lived 
together during the important years of early childl-ood. 

The third group included all /o.s/cr sih'inys, i.c.. two unrelated chil- 
dren li\ ing in the same home. This in turn was subdivided into a group 

’■’A non-lurigiiage and rdatively “cCuurc-fiec” intelligence lest (cf. Ch. 21). 

^ mean gam is slightly l.imci than 3 limes its piobablc cr«or. Freeman esti- 

mates that the net avciagc gain is 7 5 IQ pomls after allowance is made for in- 
accuracies m the sUndardi/aluui of ihc form of tbv Sianfoid-Binct in use at the time 
of the study. 
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of 40 pairs consisting of a foster child and an own child of the foster 
parents, and a group of 72 pairs of unrelated foster children. In the 
former, a correlation of .34 was found between the IQ’s of the two 
children in each pair; in the latter, the correlation was .37. It will be 
noted that these correlations are higher than those between true 
siblings adopted into difTcrent foster homes. 

Finally, all the children were included in one composite group of 
401 cases. This composite, labeled the ho)ne group by Freeman, was 
employed chiclly in making general comparisons bet\\cen the foster 
child's intellectual and social development, on the one hand, and such 
factors as foster parents' intelligence and cullural level of the foster 
home, on the other. In the entire group, a correlation of .4«S was 
found between child's IQ and cultural rating of the foster home. This 
correlation rose to .52 when only children adopted under the age of 
two years were included. Presumably these children, having lived in 
the foster home from a younger age, showed the inlluence of the foster 
home more clearly in their intellectual development. The correlation 
of child's IQ with foster fathers Otis score was .37 (N FSO), and 
with that of the foster mother .28 (N - 255 ). These and other similar 
correlations suggest the importance of home environment in intellec- 
tual development.''* 

1 he principal ditlieultics in the way of an unambiguous interpreta- 
tion of the findings of the Chicago study are the selective phiceiuenf 
of foster ehildreri and the possible unreliuhilify of the initial IQ's in 
the pretest group. Ii is a WTll-known policy of placement agencies to 
“fit the child to the home.'’ Moreover, the more intelligent foster 
parents may themselves be more concerned with the intellectual level 
of a child whom they are considering for adoption. While the less 
inlelhgenl losier parents would hardly demand or choose a less intelli- 
gent child deliberately, they may be less concerned with intellectual 
level and may base their decision primarily upon other considerations. 

Freeman and his co-workers looked into the possibilities of selective 
placement in theii study and were inclined to minimize its cITecl. The 
adoption records show^cd, for example, that health, sex, race, and 
physical appearance were specified by the foster parents in their appli- 
cations much more often than intellectual level. When the latter was 

’‘The specific lesiilts citcil in the above discussion arc all based on the Slan- 
lord-Binet and ihe Otis tests. The International and the vocabulary tests yielded 
closely similar results in all the cases in which they were employed. 
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mentioned, it was only to request that the child be ‘^normal/’ and this 
request was made as often by the less intelligent as by the more intelli- 
gent foster parents. Moreover, in over 80% of tlic cases no intelligence 
test scores were available for the children or for their natural parents. 
Despite these findings, it should be borne in mind that the placement 
agency could still use other knowledge about the children to estimate 
their relative intellectual level. Data on the education and occupation 
of the child’s own parents were probably employed in many instances 
in choosing a ‘Suitable" foster home (cf. 34). In addition to delib- 
erate attempts to place the more '‘promising" children in the better 
foster homes, a certain amount of unsuspected selection may have 
occurred through the factors of illegitimacy and age, as previously dis- 
cussed. Younger foster children, it will be recalled, tend to come from 
superior families. At the same lime, superior foster families more 
often request and adopt younger childien. rreeman, for example, 
found a correlation of .27 between the cultural r.iling of the foster 
home and the age of adoption of tlie child. We may well ask, then, 
what accounts for the higher IQ’s of children in the better foster 
homes. Arc they brighter because their parents were more intelligent, 
or because they have been reared in a superior foster home? Since 
the two factors cannot be isolated under existing adoption practices, 
the question cannot be camclusively answered from the data at hand. 

With reference to the gain in 10 shown by the children in the pretest 
group, it has been suggested (20, 54) that the initial score may have 
been spuriously lowered by emotiona! stress. !f the child is tested 
while living in an institution or h. )arding home, or shortly after arrival 
in the new foster home, he is likely to be in a period (4 uncertainly 
or readjustment. The emotional condition of the child at such a time 
is probably not conducive tc' his best pcDormanee on an intelligence 
test. 1 he rise in score upon retesting after several years of residence 
in a single foster home may thus simply re (led the child’s better 
adju.stmcnl to the home situation and his greater freedom from 
anxiety and other unfavorable emotional condition-. 

A longitudinal approach to the study of foster child intelligence is* 
represented by the long-range project c uiduclcd by Skodak and 
Skecis at the University of Iowa (55, 59, 60, 6t ). Slanford-Binct 
lO’s were determined periodically on an original sample of 306 chil- 
dren placed in foster homes under the age of 6 months and legally 
Kiihlriiarui-nmet \\as at ages 3^2 or r 
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adopted I he large majority of the children were illegitimate The 
average age of placement was slightly under 3 months, at the time of 
the tirst examination, the age of the group ranged ti om 1 Vi to 6 years 
and averaged 2 years Retests weie made at a\crage ages of approxi- 
mately 4, 7, and 13 years, the number of children available tor the 
last retest being 100 The aveiage IQ of this group of 100 on eaeh of 
the retests is shown below 

Avtiage ngt 2 4 7 H 

A\uagt IQ 117 112 114 107 ( 1 M6 St intord Bintl) 

1J7 (19 7 StinloidBmct 1 oim 1 ) 

In the n-ycai retest both the earlier (1916) and the revised ( 1937) 
form of the Stanloid Binet were used, smee the older iorm had betn 
used at the younger ages I or the most reliable and \ ilid measure ot 
the childicn s intellectuil lc\el, howe\er the lesults wjlh Ihi ilm cd 
loim should be eon^^idered 1 he eiiliLr torni is likcl\ to undvrcsti- 
mate the mtelhgenct ol older ehildren, owing to eeitain inaecur^siLS 
oi standaidi/ation 

Initial IQs ol thw lo lei children showed negligible oi ^cio eorxth- 
tions will! intelleelinl cduLationil or oeeiipalional level ol either 
lostei parents oi true purdPs (when inlorni ition w is a' iil ible on the 
latter) Witli ineiea in » ige howevei th^ e-oirelation between ehihl s 
IQ and hue motlKi ^ IQ »v)se to -44 (6^ cases) The low correlations 
With early IQ m i\ have ^e^ulled from the unreli ihility ol pie ehool 
testing and tiom the n ituic ot the funebems tested at thost agi s As 
the tests bee uiic moie Vcrbi! in(t lb^tl let nn i less sensori-molor \ 
nituie the.y became bettd me isuies ol sdioh tic iptitiide' ci intcili- 
geiiee (el Ch S) 1 lie le^'cmbiintc oi tnese ehildim to their line 
nntOers m iv lojieiHv tx the lesiUiP herv,(litiiv slnicturallv imposed 
limit itions ol devcKp'ikPt oi ol picnitil environment ! Kkt)is It 
ma> alM howevei be th^ le^-ult ol selective placement, sinec the 
ehildien of the better educated parents were, in t let, pi iced in the 
better loster homes ' 

In their interpretation of these findings Skodak and Skecls phee 
the major emphasis upon the rel itively high average IQ ot the lostt r 
eliildren m the light eP ihe low intellecttnl status oi their line parents 
Ihcsc interpretations have been the eentei of much eontioversv (19, 

^ 1 he corrtlitifn {( a) b<.i\VLtn nut molhors IQ nul octiip ii on il stitis of the 
lostti fithti w IS (41) 
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41 ). The highest grade reached in school by the true mothers and true 
fathers (when data were available) docs not appear to be signifi- 
cmitly below the average for the general population. Since the major- 
ity of the children were illegilimale and all were adopted at an early 
age, we should expeet them to be a relatively superior group. On the 
other hand, the authors point out that the schooling data tend to 
overestimate the true patents’ ability, since many had been doing 
poorly in school, were old for their grade, etc. Moreover, the IQ's of 
80 true motiicrs. described as representative of the entire group, aver- 
aged 93.'^ Occupational, economic, and social status of the true fami- 
lies was quite low, many of the lamilies being on relief. On the whole, 
the IQ's of the foster childien do seem lo be higher than might have 
been expected from what is kiKwvn of iltcir true lamily background, 
but little moic can be concluded. 

Foster Cliildren of Feebleminded Mothers. Probably the princi- 
pal source of contention in the Skodak-Skeels study was the authors’ 
statement that the children of Iccblcmindeti mothers were indistin- 
guishable from the rest of the group in their subsequent intellectual 
development in the foster homes. Although only 16 ol the true 
mothers who were lested fell clearly vvitinn the leebicminded range, a 
certain measure of corroboration is to be found in the data of other 
investigators. In the previously cited Chicago study (13;, 86 foster 
children of meniaily defective mothers, adopted under the age of 
5 yeais, had an average IQ of 95.1. In a group studied by Speer 
(63, 64), the IQ’s of 12 children of feebleminded mothers, placed in 
boarding homes under the agv of 3. averaged 100.5. In the same 
study, 16 cliildren vvho hdd remained with ihcir Iccblemindcd mothers 
until they were from 12 to 15 years old had an average IQ of 
only 53. ] . 

In a study conducted by Stippich (69) at the Ujiiversity of Minne- 
sota. 48 children of feebleminded mothers were compared with 29 
children of normal mothers, all 77 children having been placed in 
boarding homes or inslilutions bclore the age C‘i one year. I'he last 
intelligence test given llicsc children, at average age of 4*/i years, 
showed a mean KJ of S9.38 lor the “experimental” group (with 
feebleminded mothers) and 103.63 for the control grc'up. It should 
be noted, first, that this study is in general agreement with those pre- 

‘^Oiiginally rcpoiU*d as 87, but -Ahi-n coriected by using adult CA of !5 rather 
than 16, a mean ol *^3 found (41 ). 
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viously cited in finding that, when the children of feebleminded 
mothers are reared in normal homes, their IQ’s average considerably 
higher than the IQ’s of their mothers. The mean 10 of the mothers 
in this group was about 61, with a range from 32 to 77. Among the 
children of these mothers, 15 had IQ’s above 100, and only 6 below 
70. The children undoubtedly included some, at the lower end of the 
distribution, whose feeblemindedness resulted from unidentified struc- 
tural deficiencies which limited their behavioral devek^pment. Such 
structural deficiencies, of course, could themselves be determined 
either by hereditary factors or by prenatal or natal environmental 
factors. 

Certain other findings in the study by Stippich are of interest. Most 
of the children in both experimental and control groups showed a rise 
in 10 from early to later tests. These data thus lend no support to the 
argument that the children of feebleminded mothers nia}/ ‘‘fall be- 
hind” as they grow older and are measured with more reliable tests 
of intelligence. That both groups made a poorer showing than most 
foster groups is understandable when we consider that none of these 
children were adopted, but rather that they were placed in boarding 
homes. Such homes are generally of lower socio-economic Itwel tlian 
foster homes, the children often being taken for boarding in order to 
supplement the family income. The interest and attention shown 
toward the child are usually less than in the case of an adopted child. 
Moreover, most of the children were shifted about from home to 
home, or from institution to home, the number of different place- 
ments per child ranging up to 9. Such a situation is not conducive to 
good adjustment or optimum intellectual development. As for the 
comparison betw^’cn the final IQ's of experimental and control groups, 
it seems likely that some selective placement occurred and may ac- 
count for part of the differences in IQ’s. The author maintains that 
there was no tendency for the agencies to place children of feeble- 
minded mothers in lower-level boarding homes. The distributions of 
occupational levels suggest, however, that some such tendency may 
have operated. For example, 14.6% of the experimental and only 
8.2% of the control placements were in homes classified in the next 
to the lowest occupational category; 1.8% of the experimental and 
none of the control placements occurred in homes in the lowest occu- 
pational category. Moreover, it is possible that in other aspects of 
the home environment, not indicated by the crude occupational cate- 
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gories, the boarding homes in the experimental group may have been 
inferior. Thus even within the same occupational category the board- 
ing homes in the experimental group may have represented the lower 
end of the scale. 

Tn criticism of some of the Iowa and Chicago results, it has been 
pointed out (41) that the mothers may not have been truly feeble- 
minded, but that their IQ’s may have been spuriously lowered because 
of testing conditions. If, for example, the mother is tested shortly 
before or after the birth of an illegitimate clnlJ, her emotional condi- 
tion may not be conducive to good performance on an intelligence 
test. On the other hand, the results were no diiTerent when only 
mothers who had been insliUilionali/ed as feebleminded were included 
(63, 64, 69). Interpretation of the results of any of these studies is 
dilhcult, however, without infornuilion regarding the father’s intellec- 
tual status. It can be argued that the child may have ‘diormal heredity” 
if his father is normal, even though the mother wxis defective. Still 
another possilile explanation is that the mothei's feeblemindedness 
resulted from either prenatal or postnatal enviromnental factors, and 
that no hereditary factor — which could be transmitted to the child — 
was involved. 

Concluding Evaluation of Research on Foster Children. In eval- 
uating the contribution which the study of foster children as a whole 
has made to the analysis of heredity and environment, four major 
points merit consideration. First, all investigators agree in finding that 
inteliecliial development is alTcclcd, to a greater or lesser degree, by 
the type of home environment 1 which the child is reared. Secondly, 
the existing conditions of adoption make a more precise analysis of 
contributing factor^^ imposMblc. I here a: too many unknown or un- 
controlled variables whose influence cannot be isolated, 'fhirdly, the 
study of foster children is not — as has frequently been implied — a 
technique for comparing the relative contribution of “heredity” and 
“environment.” It is at best only a means of investigating the inllu- 
ence of one phase of environment, namely, the tyoe of home in which 
the individual has lived for a certain numlvT of years (often a rather 
small number!). Other important aspects of the environment arc not 

’■’That the home enviionments ol the two gioups were not fully eornparahlc 
seems qi.iie clear. The cxpenmciual group iFicliided, for example, one child who had 
spent the first nine months of his life with Ins fcchicmmded mother and the rest of 
the time in an nislitntion for the teLblcmindcd, never having bee i m a heralding 
liome at all! 
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covered. Schooling, for example, is relatively uniform for all individ- 
uals in the group. It would thus serve as an equalizing influence, 
tending to reduce the effects of varying home environments. Other 
influences outside the home, including community groups, organiza- 
tions such as the Boy Scouts, and the like, arc also probable equaliz- 
ing factors. Prenatal and natal conditions are another set of environ- 
mental influences which arc not considered in these studies. It would 
thus be misleading to regard the foster children studies as indicating 
the contribution of ‘’the environment” to individual differences. They 
can only show the relative contribution of one restricted aspect of 
environment, as against all other combined influences of environment 
as well as heredity. 

Finally, because of placement policies and practices, even that 
phase of environment which is investigated, viz., parental and nomc 
status, is artificially restricted. If the total range of American homes 
were covered, reaching down to the most deficient, then the observed 
effect of home environment upon intellectual development would 
probably be greater. Extending the range still further, to include other 
cultures where the standard of living is lower, would increase the 
relative contribution of home environmcnl even more. It is al:iO doubt- 
ful whether those aspects of the child’s home environment which arc 
most important for intellectual development have been adequately 
covered by the ratings of home environment which have been em- 
ployed, An index which concentrated exclusively upon the most rele- 
vant characteristics miiflit yield a higher correlation with IQ. 

INSTITUTIONAL HNVIRONMLN7S 

Closely related to the analysis of foster family relationships is the 
study of children reared in institutions. Despite the apparent uni- 
formity of their institutional home, such children generally show 
nearly as wade individual differences in intelligence as children living 
in their owai homes. Moreover, in one investigation conducted in 
England (32), correlations m the .20’s and .30's were found between 
the intelligence test scores of orphanage children and the occupational 
status of their own fathers. It should be noted that these children 
were placed by the institution in boarding homes until the age of 6. 
From 6 to 16 they lived at the orphanage, where they attended the 
same school. Since the occupations of the fathers were known to the 
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orphanage staff, one wonders to what extent selective placement and 
selective treatment in the institution may have artificially raised the 
reported correlations. It was also found that, among children admitted 
to the orphanage before the age of 3, intelligence test scores showed 
a lower correlation with parental occupational level than in the case 
of children who remained with their parents after the age of 3. In 
another British study (28) on orphanage children aged 9 to 16, a 
similar relationship between child\ intelligence and parental occupa- 
tion was noted. But the intellectual dillerentiation between occupa- 
tional classes, as well as the extent of individual differences within any 
one class, tended to decrease as length of institutional residence 
increased. 

A fairly well-established finding is that orphanage children on the 
whole have lower IQ’s than those reared in either boarding homes or 
foster homes (12, 17, 36. 57, 78). In itself, such a finding permits of 
at least two explanations. First, selective laefors may gradually elimi- 
nate the brighter childien from an orphanage group, since such chil- 
dren are the most likely to be chosen ior adoption. Secondly, institu- 
tional cnviroiunents in general are relatively unslimulatiiig to the 
developing child. Orplianagcs vary widely among themselves, of 
course, in the type of cnviionment wliich they piovide. Problems of 
overcTOW^ding, staffing, space, equipment, and other tacihties natu- 
rally produce differences in the amount and type of stimulation which 
the child receives. The ratio of adult members to children varies 
in different orphanages from about 1:2 to about 1:25 (cl. 78). To a 
certain extent, tlicse difference, in iastiUilionai environments arc re- 
flected in the IQ's of the children. Some of the apparent^ inconsistent 
results found by investigators in diffcrc^n orphanages arc probabiy 
attributable in part to such insliluiional ddlcrcnecs. 

In a study of infants between the ages ol 6 and 12 weeks, Ciilliland 
(17) compared the performance of over 300 institutional infants with 
an equal number of infants living in their own homes. The IQ’s of the 
institutional infants on the Northwestern infant Imelligencc Test aver- 
aged significantly lower than those of tlr infants in private homes.-^^ 
Of the 40 items in Die test, 18 showed a significant difference in favor 
of the infants in private homes. These items dealt with behavior which 
would be influenced by the nature and extent of the child’s contacts 

The dilTcrcnce between ♦hese ;neans was siynilicant at a hj»;h level of con- 
iidence, the crtical lalio being ovci 4 
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with his social and physical environment. Items concerned primarily 
with maturational changes showed no difference between institutional 
and non-institutional groups. 

Several of the Iowa investigations were concerned with orphanage 
children. In one orphanage which offered relatively little opportunity 
for intellectual development, a nursery school was set up for a trial 
period (58). Some evidence was found that the lO’s of the nursery 
children tended to rise, while those of the other orphanage children 
dropped somewhat. LI n fortunately, long-rarge comparisons could not 
be made satisfactorily in this study, since children were eliminated at 
frequent intervals from both groups, owing to adoption, and others 
were admitted. 

In another widely quoted Iowa study (56, 57), 13 orphanage chil- 
dren under 3 years of age, with lO's ranging from 35 to 89, were 
placed as “guests'’ in an institution for feebleminded women, one or 
two children being placed in each w\ard. I'herc were about 30 women 
in each ward, ranging in chronological age from 18 to 50 and in men- 
tal age from 6 to 12. Despite their own intellectual backw^ardness, 
these women had higher mental ages than the children, of course, and 
were thus able to provide considerable intellectual stimulation for 
them. Together witii the ward allendanls, the feebleminded women 
evidently lavished attention and affection upon their 3^oung visitors. 
After about 18 months of this regimen, the infants gained an average 
of 27.5 10 points, while a control group which had remained in the 
orphanage lost an a\crage of 26.2 points. The contrast between these 
two groups is undoubtedly cKaggcraled by the regression effect, which 
is considerable at these age level.N owing to the unreliability of infant 
tests (cf. Ch. 8). The '‘experimentar’ group had an initial average 10 
of 64.3. while that of the control group was 86.7. Through regression, 
the initially lower group would be expected to rise somewhat and the 
initially higher to drop, since they were both samples of the same 
orphanage population. It is unlikely, however, that the entire differ- 
ence can be attributed to regression. 

A wx^rd should be added regarding the apparent contradiction be- 
tween the above findings and some of the previously cited results on 
children of feebleminded mothers. It will be recalled that those chil- 
dren who remained w'ith their feebleminded mothers tended to lose 
in IQ with age, while those who W'erc placed with more intelligent 
foster parents seemed to develop normally. If the specific stimulating 
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conditions in the two types of studies arc considered, it will be seen 
that the situations are only superficially similar and that no real con- 
tradiction exists. A home conducted by a feebleminded mother is 
probably poorly organized and inefficiently run. The mother herself, 
finding it difficult to cope with the everyday problems of living, may 
have little time or energy left to devote to the child. The emotional 
atmosphere, too, may be unfavorable in many such homes because of 
frustrations, economic difficulties, irresponsibility, and similar con- 
ditions. The institutionalized feebleminded girls, on the other hand, 
were free from other responsibilities, had little else to keep them occu- 
pied, and enthusiastically welcomed the diversion of caring for and 
entertaining the one or two babies in their ward. The amount of adult 
attention v.hich the child receives would thus be quite different in the 
two situations. 

The important part which attention from adults plays in the intellec- 
tual and emotional development of the child is being increasingly 
recognized. An interesting demonstration of this fact was provided 
by a comparative study of the development of infants in two Euro- 
pean institutions (65). One of the two, described as “Foundling 
Home," was an ordinary orphanage in which hygienic and medical 
care was excellent, but adult contacts and other forms of stimulation 
were at a minimum. The infants in Foundling Home were kept iso- 
lated in cots, with no toys or objects other than bedding and clothing. 
There was very little for the child to see and practical!) no oppor- 
tunity for locomotion. Essentia! physical care was provided by a 
trained nursing staff, each nurse having charge of eight babies. The 
other institution, designated as “Nursery,” was established to care for 
the new-born babies of delinquent giiis in a penal institution. It was 
closely comparable to Foundling Home in its physical and medical 
facilities, but the Nursery children were less isolated, had a certain 
number of toys, and were cared for by their mothers under the super- 
vision of the nursing staff. The investigator g'ves some evidence to 
show that in terms of parental background the children in Foundling 
Home were supeiior to those in Nurse y. 

Both groups were examined periodically with the Hetzer and Wolf 
Baby Tests. The initial developmental quotient, based upon perform- 
ance during the first four months of life, averaged 101.5 for the 69 
Nursery children, and 124 for the 61 children observed in Foundling 
Home. By the end of the first year, the developmental quotient of the 
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Foundling Home group had dropped to 72, while that of the Nursery 
group averaged 105. Subsequent observations showed that the Nursery 
quotient remained close to normal, while that of the Foundling Home 
children continued to drop, reaching an average of 45 by the end of 
the second year. It should be noted that while the Foundling Home 
environment represented an extreme degree of isolation and lack of 
stimulation, the Nursery group received more attention, on the whole, 
than children in the typical family situation. For the mothers in the 
delinquent institution, the care of their child was one of the few 
sources of satisfaction and pride. 1 he large majority of these children 
thus received an excessive amount of adult attention. 

A considerable proportion of the infants in Foundling Home devel- 
oped w'hat appeared to be a clear-cut clinical syndrome, including 
extreme depression, retardation in all behavior development, and — 
in severe cases — complete withdraw'a! and immobility (65, 66). In 
its early stages, this condition could be improved by returning the 
mother to the infant or, if that was impossible, by placing the child 
where he was free to move about and had contact with cither children 
and adults, if, however, the depressed condition was allowed to con- 
tinue unchecked for about three months or more, the child bailed to 
respond to such changes in treatment, the damage to its behavi('»r 
development appearing to be permanent. A combination of emotional 
and intellectual deprivations is apparently involved in the extreme 
conditions represented in these observations. 

Psychiatric disturbances among children reared in an institutional 
enviionmcnt during the hrst year of life have also been reported by a 
number of other investigators.-^ Ribblc (50) has repeatedly called 
attention to two g^encral types of reaction which commonly dcvelr^p 
among institutionali/ed infants who have recci\ed insiinicient “psy- 
chological mothering.” Some sliow a sort of negativism, which may 
include loss of appetite, failure to assimilate food, muscular tension 
and rigidity, and violent screaming. 1 he other type of reaction, which 
Ribble characterizes as t egression, consists of excessive depression, 
quiescence, and inactivity amounting almost to stupor. This condition 
often leads to a ‘'wasting away” despite adequate diet and phydeal 
care. 

Because of methodological dilhculties, small number of cases, and 

For reference to a nunibci of these studies, cb Spitz (65), Goldfaib (18), and 
Ribble (50). 
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similar limitations, the studies on institutional environments can do 
little more at this stage than provide promising leads for future re- 
search. The bulk of their evidence docs, however, point in the same 
direction as the results of other types of investigalions. All such evi- 
dence indicates that certain aspects of the child's home environment 
may exert considerable influence upon his subsequent behavior. A 
close relationship between the child and one or more adults appears 
to be an important prerequisite to both emotional adjustment and 
intellectual development. The data on language development, reported 
in the section on twins, should also be recalled in this connection. 
Similarly, a comparison of the language development of preschool 
child! cn living in their own homes with that of preschool orphanage 
children favfU'cd the former group (43). The children reared in their 
own homes excelled in size of vocabulary and in the variety of sub- 
jects about which they talked. These differences persisted when sex, 
chronological age, and mental age were held constant. Among the 
environmental factors mentioned to account for the observed differ- 
ences are the adult-child ratio and the number and variety of experi- 
ences about which the children can talk. 
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liodily Co nJHIons 
and Behavior 


Rizpfatfd RFI ERENcr HAS pr.PN MADE in ihc preceding chapters to 
the possible limits of behavior development set by the imiividuars 
structural characteristics. Age changes in behavior provided some 
illustrations of such limitations. Until the infant Iras attained a pre- 
requisite level of sensory, neural, and muscular de\eiopment, for 
exan.ple, certain specilic behavior functions may be very difficult or 
impossible to learn. Similarh , the physical deterioration in senescence 
is likely to curtail many of the person's activities. Physical dilTcrcnccs 
among individuals may likewise contribute to the obscived individual 
dilTerences in behavior. It is apparent that extreme sensory defects, 
for example, can s(> serious!}- lianoicap the individual that even special 
training may not bring him up to a normal level of performance. 
Other parts of the reacting organism may. through cither their defi- 
ciency or their superiority, alTcct the development of psychological 
traits. Certain forms of fecbleminde Incss are undoubtedly traceable 
to structural defects wffiich prc\cnt l:,c attainment of a normal level 
of behavior deveiopmeni, despite adequate stimulation. 

It should be clearl} recognized, however, that any structural char- 
acteristic whose inllucncc on bchavii^r development is demonstrated 
serves as a naessavy but not a snfhdent co'idithm in such develop- 
ment. In other words, liie presence c the prerequisite physical factor 
does not in itself determine behavior, but simply makes a certain kind 
of behavior development possible, ij the proper stimulation is avail- 
able. Moreover, structurally imposed limitations arc probably less 
elfective than is commonly supposed, since individuals rarely attain 
the degree of development set by their physical capacity. Physiologi- 
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cal and biological conditions may thus be regarded as “participating 
factors” in psychological reactions, rather than as the underlying 
determinants of any behavior function.^ 

The relationship between structural and psychological characteris- 
tics is of special interest for an analysis of the contribution of heredi- 
tary factors to behavior. In so far as hereditary factors affect behavior, 
they must do so through their control of structural development. 
Obviously the individual does not “inherit” functions as such. The 
study of any correspondences between behavior differences and dif- 
ferences in bodily condition would thus seem to be a necessary first 
step for a realistic consideration of the role of heredity in behavior. 

There are a number of well-known types of intelicclual and emo- 
tional disorders which are directly traceable to extreme glandular 
malfunctioning, the deterioration of tissues, the effects of drugs or 
infections on the nervous system, and similar abnormal conditions. 
The behavior symptoms associated with paresis, delirium tremens, oi 
cretinism, for example, can be clearly related to the physical effects 
of syphilitic infection, alcohol, or thyroid deficiency, respectively. 
Many other similar illustrations could readily be cited. The present dis- 
cussion, however, is not concerned with these extreme and p?flhologi" 
cal conditions. The question before us now is essentially this: “ To 
what extent are individual differences in behavior associated with the 
structural differences commonly found within the normal range of 
variation?" 

CRANIAL AND CLRLRRAL ML ASURL iVItNTS 

Popular interest in the size and shape of the shull was considerably 
stimulated by the pseudo-science of phrenoUtgy, initiated by Gall in 
the last years of the eighteenth century. Phrenology was based upon 
a false ntUion of the functions of the various parts of the cerebral 
cortex. The phrenologists maintained that each area of the brain con- 
trolled a particular intellectual or moral function, such as mechanical 
ingenuity, veneration, domestic impulses, and other equally complex 
and vaguely defined activities. They asserted further that the over- or 
underdevelopment of such behavior characteristics could be diagnosed 
by examining the protrusions of the skull. The location of a particular 

^ For a scaiching and systematic analysis of the role of structural factors in 
psychological functioning, cf. Kantor (40). 
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“bump” was taken to mean that the function allegedly controlled by 
the corresponding cortical area was highly developed in the given 
individual. 

It would seem unnecessary to refute such an obviously untenable 
doctrine were it not for its enduring popularity among the general 
public and its lucrative practice by a considerable number of charla- 
tans. In the tirst place, phrenology is founded upon the erroneous 
assumption that there is a close correspondence between the shape 
of the skull and that of the brain. Such a correspondence is hardly to 
be expected, in view of the ccrebro-spinal iluid and the several layers 
of membrane which intervene between brain and skull. It should also 
be noted that size does not provide a satisfactor> index of degree of 
development within the nervous system. It is the complexit> of interre- 
lation of the minute nerve cells and other microscopical characteristics 
of nerve matter that are probably related to etheiency of function. 
Moreover, the type of trait which phrenoh^gists ascribe to ditlcrcnt 
brain areas is quite unlike the functions discovered through investiga- 
tions of cortical localization. Connections have been demonstrated 
between certain muscle groups or sense organs and specific brain 
areas, but this is a tar cry from the localization of “literary propen- 
sities” or “love of dumb animals'' on the cortex! 

Phrenologists have also tried to show that cranial capacity as a 
whole, or total brain size, is related to intelligence. Their evidence for 
this, as for their other assertions, is based upon selected examples 
and is therefore worthless. It is true, for cXiimplc, that a certain type 
of idiot — the mierocephalic -ha» a very small skull. But there are also 
idiots with normal or very large skulls. A few men oi genius may be 
lound with very large brains,- but others arc likewise found with 
smaller than average brains. The question can he settled only by pre- 
cise measurement of laige numbers of unselected cases. 

Invcstigaiions on the relationship between cranial capacity and 
intellectual achievement have generally yielded negative results. In a 
number of early studies in which average cranial dimensions of bright 
and dull groups were compared, the Jala are ambiguous and diflicult 
to interpret." The diflerenccs betwee:^ the averages are always ex- 
tremely small and occasionally inconsistent from one comparison to 
another. In certain investigations the measures taken on the living 

- A lavonic example is Daniel VVcbsicf, whose head ciicumlerence measured 
241/2 Jiiches. 

^ For a survey ot these data T. Paterson Ch. 3). 
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skull were not good indices of brain capacity. The groups employed 
often varied widely in age. When children are included, this may pro- 
duce a spurious relationship between size of head and intelligence, 
since the older subjects will have larger heads and at the same time 
will obtain higher scores on intelligence tests. Finally, the estimates of 
intelligence were frequently crude and unreliable. 

The first well-controlled study on cranial measurement in which 
adequate correlational analysis was employed is that of Pearson (66). 
Measures of head length, head breadth, and cephalic index were 
obtained on three groups, including 1010 Cambridge University stu- 
dents, over 2200 12-ycar-old school boys, and over 2100 12-year-old 
school girls. It will be noted that age was held constant among the 
children by selecting only 12-year-olds. The subjects were classified 
into intellectual levels on the basis of teachers’ ratings and scholastic 
records. The c(^rrclations between intellectual level and cephalic index 
were —.06, —.04, and .07 among the university students, school boys, 
and school girls, respectively. For length of head, the correlations in 
these three groups were .11, .14, and .08, and for breadth of head 
.10, .1 1, and .11. These correlations speak for themselves, being too 
low in every case to indicate any appreciable trend. The ^'ry low 
and inconsistent correlaticms with cephalic index lend no support to 
a frequently proposed theory that the '"long-headed” individuals (with 
a low cephalic index) arc the Biorc intelligent. Nor do they support 
the opposite view, also occasionally voiced, that the “broad-headed” 
(with high cephalic index) are the more intelligent. 

More recent investigations b}/ the correlation method have in gen- 
eral substantiated Pearson’s findings. Murdock and Sullivan (62) re- 
port a correlation of .22 between head diameter (obtained by aver- 
aging maximum head wadth and maximum head length) and 10 
with about .^96 elementary and high school pupils. By the use of 
IQ’s and by the conversion of physical measurements into deviations 

'^Cephalic Index — , I ength of head is measured Irom I he space 

head length 

between the cycbiows to the lailhesi projeelion at the back of ihc head; head width, 
or breadth, js the distance from left to right sales, measured liom the points v)f 
maximal pi otrusion above each ear. I'he following is a common classification of 
cephalic index: 

Dolichocephalic, or long headed Cl below 75 

jVlesoccphahc. or mcdiiirn-headcd Cl between 75 and 80 

Brachyccphalic, or bioad-hcaded Cl above SO 

The number of cases differed slightly for each measure. 

® Found from a number of group intelligence tests. 
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from the average of each age-sex group, the influence of age was 
held constant. In a study of 449 medical students in Scotland, Reid 
and Mulligan (75) found a correlation of .08 between cranial capacity 
and scholastic achievement. Cranial capacity was calculated by taking 
the product of length, breadth, and width of the head, with allow- 
ance for thickness of different parts of the cranium. Scholastic 
achievement was determined by performance on standardized ex- 
aminations in three courses which were taken by all the students. 

Sommerville (92) obtained correlations of .10, .03, and .09 be- 
tween the scores of 100 male college students on the Thorndike 
Intelligence Examination for High wSchool Gradihitcs and measures 
of bead length, head width, and head height, respectively. The cor- 
relations were no higher between intelligence test scores and cranial 
capacity as estimated from the three given head dimensions. Employ- 
ing one standard fornnila for the computation of cranial capacity, 
'Sommerville found a correlation of .11 with intelligence test scores; 
with another formula, the correlation w-ais .10. These findings were 
closely confirmed in a more recent investigation by Broom (9). 
Cubic brain capacity, as estimated from external measures, gave in- 
significant correlations w'ith intelligence test scores in a group of 100 
college men and 100 college women. 

It thus seems to be quite conclusively established that no appre- 
ciable relationship exists between intellectual level and either cranial 
capacity or head shape as determined by the cephalic index. The 
correlations, although generally positive, are so low as to be of 
doubtful significance. SoiUv. dis'>cnting views are occasionally ex- 
pressed, even by mc^dern wiaters. advcKating the use o1 cranial meas- 
urement in the diagnosis of intcllc::tual development.* But their 
evidence is ambiguous and tlieir arguments are weak and inconsistent. 
Additional data on more detailed cranial conformation will be dis- 
cussed in a later section of the present chapter, in conjunction with 
facial measurements. 

In recent years, interest has shifted from fee gross characteristics 
of brain size and shape to less obvious conditions which arc more 
likely to influence brain functioning (47, 84). For example, the 
thickness of the cerebral cortex has been measured, and the concen- 
tration and distribution *// cells has been determined by taking sample 
counts in different sections. Fissurizatkm. or the nature and extent 

“ Cf„ eg., (73^ and, for a critique of the d.ita, PaterMin (65). 
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of “folds’* in the cerebral cortex, has been considered to be signifi- 
cant by some writers, since the brains of lower animals and immature 
organisms arc relatively smooth. Family similarities have also been 
noted in fissurization. Attempts to discover any correlation between 
behavior characteristics and any of these brain conditions, however, 
have so far met with consistent failure. Similarly, the chemical com- 
position of the brain, although undoubtedly an important factor in 
certain pathological cases, has as yet never been related to specific 
forms of behavior within the normal range of variation. The present 
state of knowledge regarding the relations between these various 
brain conditions and behavior has been aptly characterized by Lash- 
Icy. In a paper presented in 1947, he states (47, p. 326): “An at- 
tempt to relate phylogenetic and individual differences in behavior 
to brain structure is therefore rather an adventure in correlating the 
mysterious with the unknown/’ 

A relatively new and promising field of brain research centers* 
around the electroencephalogram (rHG), a record of the minute 
changes in electrical potential generated in the brain (16, 50). A 
special advantage of this technique is that it permits a study of brain 
function in the living organism. Since many important projterties of 
tissues are lost upon death, postmortem brain examinations may 
exclude essential facts. By means of electrodes attached to the scalp, 
the minute “brain waves,’* or ‘'fluctuations of electrical potential in 
the brain of the living person, arc picked up, magnified, and recorded 
graphically. It will be recalled (Ch. 5) that evidence of electrical 
activity in the cortex of the guinea pig was found during fetal life, 
although in the human brain no conclusive evidence of such activity 
has been found until some time after birth. Several types of rhythmic 
changes in electrical potential, dilTering in frequency and amplitude, 
have been identified in the adult human brain. Some of the most 
clear-cut results have been obtained with the alpha mives, which 
have an average frequency of about 10 per second and are found in 
normal children and adults during a relaxed waking state. Fairly 
consistent age differences have been observed in the frequency and 
amplitude of these alpha waves, as well as in the per cent of time 
that the alpha rhythm is present (49). Between the ages of 3 and 10, 
for example, there is a progressive increase in the frequency of the 
alpha rhythm. Sufficient data have been gathered to establish age 
norms in various aspects of the EEG (49, 50). Individual differences 
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in EEG have also been observed, the individual characteristics be- 
ing maintained with considerable consistency on successive retests 
(49, 50). 

Such findings have led certain investigators to in(|uirc whether the 
developmental changes in EEG arc related primarily to chronological 
age or to mental age. In studies on several l>pcs of feebleminded 
adults, Kreezer (44) reported a number of signilicant but gcncrallv 
low correlations between mental age and certain characteristics of 
the alpha waves. Several points should be noted, however, in inter- 
preting these results. First, the groups studied were usually small and 
many of the correlations were barely significant. Secondly, different 
characteristics of the alpha waves yielded significant correlations in 
diflerent types of feeblemindedness, suggesting that whatever rela- 
tionship exists is certainly not a simple one. Thirdly, the significant 
lelationships were conlincd to types of feeblemindedness having iHher 
clearly recognizable physical deficiencies. They were not substantiated 
in a group of “undiflercntiatcd” feebleminded cases vith no observ- 
able physical pathology. I his suggests that tiie disturbance in EEG 
may be associated with the other pathological physical conditions 
and need have no implications within the normal lange of variation. 
Other investigators have also failed to discover any significant rela- 
tionship between EEG characteristics and intellectual level among 
Linditlcrentiatcd feebleminded subjects (16, 49). 

What little direct evidence is available on normal subjects is also 
ncgati\e. In one study on noiraal children (43), for example, a sig- 
nificant correlation of .50 \ as lound between alpha frequency and 
IQ among 48 8-year-olds, but an insignificant correlation of .12 was 
obtained among 42 1 2-ycar-old>. f is possible that, among the 
younger children, individual diflercnc s in the level oi physical devel- 
opment within a single year of chronological age may account for the 
significant correlation. In a gioup of ! 100 aircrew candidates between 
the ages of 18 and 33, no relationship was found between intelligence 
lest score and alpha frequency (82). 

In the area of personality charactc i^dcs and emotional abnormali- 
ties, F^EG results arc scanty and incouciusivc, with the exception of 
the extensive body of data on epilepsy. It has been quite clearly 
established that epileptics show characteristic deviations in EEG, and 
that relatives of epileptics who have not themselves developed any 
of the clinical symptoms of epilepsy show similar disorders in EEG 
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(48, 50). It is also interesting to note that numerous case reports of 
children with behavior disorders show abnormalities of the EEC, 
some o[ them ot an cpileptoid form (50). We can probably attach 
little signilicance to the fact that several attempts to correlate per- 
sonality test scores and EEG characteristics in adults as well as chil- 
dren have yielded inconsistent and inconclusive results (32, 49). In 
such cases, the inadequacy of the personality tests as measures of 
behavior characteristics may have been partly responsible for the 
negative findings. 

PHYSIOGNOMY AND RELATED SYSTEMS 

There are many firmly entrenched popular beliefs regarding tlie 
“meaning’' of various facial and other bodily charaeteristics. The high 
forehead of the intellectual "high-brow,” the shifty gaze ()f deccitful- 
ness, the firm chin and square jaw of determination, the tapering 
fingers of the artist, and a host of other traditional associations which 
the reader can easily name have found their way not only into poetry 
and fiction but also into the snap judgments and “hunches” of every- 
day life. Similarly, we frequently hear of alleged personalii^' difler- 
ences between blondes, brunettes, and redheads, between blue-eyed 
and brown-eyed persons, or between those with a “convex” and those 
with a “concave” profile. Maity of these beliefs can be traced to 
ancient times. During the last quarter of the eighteenth century, a 
number of them were organized by Lavatcr into a system of char- 
acter analysis known as “pZ/yv/oguon/v.” Today, this system is about 
as popular among charlatans as is phrenology — and equally un- 
founded. 

A series of carefully controlled investigations designed to cheek 
many of the assertions of physiognomy w^erc conducted under the 
general direction of Hull (37). The relationship between camvexity 
of profile and several personality traits, which is often stressed by 
self-styled “experts” in physiognomy, was studied by Evans (19), 
The subjects were 25 college women, all of whom were members of 
the same sorority. Such a group was chosen because of their close 
acquaintance with each other and their consequent ability to rate 
each other with a fair degree of accuracy. For the same reason, all 
individuals who had not been members long enough to be well knowm 
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were excluded from the study. Each girl ranked the remaining 24 
in six personality traits, including optimism, activity, ambition, will 
power, domination, and popularity. The average or consensus rank 
of all 24 judges for each girl was computed as a final estimate of 
each trait. The subjects were also rated in a similar way for degree of 
blondness. A specially devised mechanical instrument was employed 
to read oil directly the “angle of convexity" of the profile. In order 
not to exclude any possibilities, convexity was measured in five dif- 
ferent ways, such as whole face, upper face only, convexity without 
including the nose, and so on. Height of forehead was also measured. 

1 he correlations between each of the measures of convexity of 
profile or height of forehead and cadi of the six personality traits 
were low and often inconsistent with expectation- - that is. a correla- 
tion which would have been expected to be negative on the basis 
of the physiognomists" claims was often positive, and vice versa. 
The highest correlations were a -[ .39 between ‘'convexity of whole 
face with nose omitted" and “activity" rank, and a —.39 between 
height of forehead and “will power" rank. Even these correlations, 
however, are not significant in view of the small number of subjects, 
and could have resulted from chance errors of sampling. The cor- 
relations for blondness ranged from ']-.28 with will pow'cr to —.26 
with optimism, 'rhese are also too low to be significant 

A further point to bear in mind in evaluating these correlations is 
that the existence of a widespread bias among the judges regarding 
the dissociation of facial and personality characteristics might in itself 
produce a correlation. Sine tesis were not available for the trails 
under consideration, it w^as necessary to resort to associates’ judg- 
ments; but this procedure is inconel jsive when widespread popular 
beliefs arc present. 

Facial and cranial ineasuremcnis were combined in a study by 
Sherman (83). A group of 78 freshmen in an engineering college 
were measured by means of a specially designed “radiometer." A 
total of 15 distances and 4 angles were found for each subject, and 
each of these measures was then cc rrelated with academic grades. 
The correlations with the combined grades on all courses tanged 
from —.26 to +.34. It is interesting to note that height of forehead 
correlated -.15 with academic grades. This corroborates the low 
negative correlations found by Evans between height of forehead and 
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several personality traits which might be expected to manifest them- 
selves in school work. If such a tendency were established, it would 
indicate a reversal of the popular notion of a “high-brow”! 

Similar studies have been conducted by a number of other investi- 
gators. One study had as its object to determine whether there is any 
relationship between the shape of the hand and a number of traits 
suggested by “chirognomists” (cf. 37, pp. 145-146). The results 
were clearly negative. Numerous experiments have been conducted 
to discover whether it is possible to judge intellectual or emotional 
traits from photographs, as might be expected if these traits were 
manifested in facial characteristics. All these investigations showed 
a lack of correspondence between the various physical characteristics 
and the behavior traits with which they were allegedly associated. 
There were cases, however, in which the judges agreed rather closely 
among themselves, a finding which suggests the prevalence of such 
popular stereotypes, or conventionalized physiognomic symbolism. 

It should be pointed out in conclusion that even when significant 
correlations are found between certain facial or cranial characteristics 
and psychological traits, as in the case of a few of vSherman’s meas- 
ures, the correlations arc still too low to give any informaAibn about 
individuals. They simply indicate a genera! trend in the group which 
may result from a few^ extreme cases. In so far as the correlation is 
far below 1.00, it shows that there are many individual exceptions 
to the general trend. Hie presence of these exceptions or reversals 
of relationship proves that whatever direct influence any such physical 
factor may exert upon behavioral development is very w'cak and can 
easily be obscured by other, more potent factors. 

A further point to note is that as long as a certain belief is widely 
prevalent regarding the association of a given physical characteristic 
with an intellectual or emotional trait, this may in itself influence the 
individuafs development. If a person is commonly mistrusted by his 
associates and is not given any responsibility, it is diflicult for him to 
be open and sincere. If a child is regarded as dull and stupid, he may 
easily come to believe it himself and act accordingly. Moreover, the 
school child whose appearance fits the popular stereotype of “stu- 
pidity” will probably receive poorer school grades than his “bright- 
looking” classmate whose actual achievement may be no better. 
The social and motivational influence of a widespread prejudice can- 
not be ignored. A vicious circle is initiated by such a situation; the 
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more widespread the prejudice, the moie effective it will be and the 
more evidence can therefore be found which seems to support it. 

From these considerations it is apparent that any relationship 
which may exist between facial characteristics and psychological traits 
cannot be large. Even the slight correlation occasionally found is far 
from conclusively established because of many remaining uncon- 
trolled factors. Sliould a slight correspondence be proved between 
certain facial or cranial conformations and behavior, such an asso- 
ciation could result from a common dependence of both types of 
characteristics upon the same underlying condition. The activity of 
the endocrine glands ofiers possibilities for such a connection. In cer- 
tain extreme pathological cases as, for example, thyroid deficiency, 
the resulting condition includes typical physical as well as mental 
symptoms. It is barely possible that certain facial charactei istics, as 
well as emotional or intellectual traits, are intluenccd within their 
normal range of variation by over- or underaetbity of some endocrine 
gland. This, ol course, is only speculation. The field of endocrinology 
is far too complex and too young to offer any clear-cut answers to 
such a query. 

BODILY DIMLNSTONS 

Gross bodily dimensions, proportion of trunk and limbs, height in 
relation to weight, and similar structural characteristics have also 
been suggested as possible indices ol intellectual or emotional status. 
Since much of the materia' in this field has been collected to test 
out the various 'hype theories’' proposed from time to time, the dis- 
cussion in this section will be supplemented in the following chapter. 
Only the data on gross size and ab^^olute measures will be treated 
here, the material on relative proportions and body type being re- 
served for Chapter 13. 

Similarly, wc arc not concerned with gross malformations and 
pathological conditions. Many of these conditions, familiar to anyone 
who has seen circus “freaks,*’ have bejn definitely traced to glandular 
disorders. Thus gigantism, a conditior in which the individual may 
attain a height of seven or eight feet,^ results from oversccretion of 
a pituitary hormone. Dwarfism, or stunted growth with normal bodily 

‘'The “p.int'’ with Ihe Rinfhnfi Bios. Bainuin & Bailcv ciicus was reported lo be 
8 feet, inches tall 
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proportions, is produced by insufficient pituitary secretion. No definite 
intellectual defect has been demonstrated in these cases. Cretinism, 
associated with an underactive thyroid, is characterized by abnormal 
bodily development and proportions as well as by intellectual defect, 
sluggishness, and other behavioral disturbances. If we exclude cases 
which manifest obvious glandular dysfunctions or other pathological 
conditions, we still find a wide range in height and weight within the 
general population. It is into the relationships of these variations 
with behavioral characteristics that we now wish to inquire. 

As in the case of cranial measurements, interest in body build 
has long been manifested. The search for a possible relation between 
body dimensions and intellect probably received a strong impetus 
from the popular view that the intellectually gifted were deficient in 
other respects. In particular, it was maintained that such individuals 
were weak, puny, and physically inferior. This notion of compensa- 
tion was cherished widely because of its consoling character — it was 
no doubt accepted as the device of a benevolent nature to “'even 
things up.’’ In the efl'ort to overthrow these unfounded beliefs, early 
research workers swung to the opposite extreme and asserted that 
the intellectually ablest were also the physically ablest and that a 
close correspondence exists between physique and mental ability. 

Galton (22), for example, maintained that the number of 
physically superior individuaJjj among his groups of eminent men 
(ef. Ch. 10) was greater than in the general population. Many 
studies on large groups of children have subsequently appeared which 
relied upon the comparison of averages for their conclusions,-^ Such 
in\estigations agree in finding a slightly higher average height and 
weight among the intellectually superior groups than among the 
normal, and slightly higher among the normal than among the dull. 
Intelligence was usually estimated quite crudely from schtx^l progress 
or teachers’ ratings. The differences in averages were always so slight 
and the overlapping of groups so large that the degree of correlation 
between height or weight and intelligence would necessarily be 
negligible. 

Investigations on the physical status of the feebleminded or the 
intellectually gifted chiid have yielded results which are equally dif- 
ficult to interpret. When averages are compared, the feebleminded 
appear to be definitely below the norms in height and weight, and the 

For a siimmaiy of tht* early lilcraiure, see Paterson (6S). 
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bright children above the norms. In Terman’s extensive investigation 
(98) on gifted children, a slight tendency was noted for the subjects 
to be above the age norms for American-born children in height and 
weight. L. S. Hollingworth (35) compared the heights of three 
groups, each composed of 45 children between the ages of 9 and 1 1. 
In the “superior” group were only children whose lO's were above 
135 (median 10 - 151 ); in the “normal, “ those with lO’s between 
90 and 110 (median 10 - 100); and in the “inferior,” those with 
lO’s below 65 (median 10 43). 'I'hc subjects in the three groups 

were carefully equated, e^ich child in the one group being “matched” 
with a child in the other two groups in respect to age, sex, and racial 
background, so that the inlluencc of these factors was ruled out. In 
I’ablc 17 will be found a frequency distribution showing the number 
of children in each group who fell within successive class-intervals 
in height, as well as the average height of each group. 


TABLE 17 Disirihuuons and Averages of Height in Intidlcctually 
Superior, Nortnal, and Inferior Cirunps 
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(Median IQ ■ 100) 

(^irt>'-ip ( -' 

(Median IQ 43 1 

SS 59 

12 


1 

50 54 

30 

30 

18 

45-49 

3 

13 

23 

40-44 

0 

0 

3 

Average 

heiglit 

52.9 

51.2 

49.6 


Norsworth} (6^), in an early but comprehensive survey of the 
characteristics oj the leebleniincleci, obtaineii measures of height and 
weieht on 157 mental defectives in special cia.ses and in various in- 
stitutions. She found the same slight ditfercnccs in averages, with 
marked overlapping, 44% of the mcmally defective children exceed- 
ing the median of normal children in weight and 45% in height.'^ 

'"Cf. Ch,:alcr 17 foi fuller repot t. , , . r it ^ , 

■' Coiiiplele oreiloppiiiR woiilil have been indicated if SO'c of the feebleminded 
group had exceeded Uie noimal median. 
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Fig. 68. Average Height and Weight of Feebleminded and Normal Boys 
at Successive Ages. (Horn Ooddaid, 24, p. 228 ) 
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Goddard (24) collected extensive data on the height and weight of 
about 11,000 mentally defective individuals, ranging in age from 
early infancy to 60 years, in 19 American institutions for the feeble- 
minded. In Figures 68 and 69 arc reproduced curves showing the 
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Age .n Years 


Fig. 69. Average Height and Weight of Feebleminded and Normal Girls 
at Successive Ages. (From Goddard, 24, p. 229.) 



388 Differential Psychology 


average height and weight of successive age groups within four intel- 
lectual levels; the data on boys arc given in Figure 68, those on girls 
in Figure 69. It will be noted that the curves of the four intelligence 
groups exhibit a slight but consistent tendency for physical inferiority 
to parallel intellectual inferiority. The relationship is clearer during 
the adolescent and adult years than it is at earlier years. 

Several factors may enter in to complicate the analysis of insti- 
tutional data on the feebleminded. In the first place, those individuals 
with physical as well as intellectual defects arc more likely to be 
committed to an institution. The feebleminded person who is physi- 
cally fit or superior is less likely to be sent to an institution at all 
iind more likely to leave the institution after he has received several 
years of training. Such individuals will have a greater chance to suc- 
ceed in a routine occupation rc(|uiring strength and a good physique, 
with a mininuirn of thought and planning. The operation of sucli a 
selective factor might explain the divergence of Goddard's height and 
weight curves with age. Since only institutional cases were tested, 
the inferiority at the upper ages could have resulted from l!ic fact 
that the physically strongest and ablest had left the institution. In 
addition, the norms in terms of which these groups are evaluated 
may not be comparable at successive ages. Such norms are iisiialiy 
established on school children because of llic latter's ready accessi- 
bility lor mcasuicment. Vhe norms at liigher ages are frequently de- 
rived from high school students, <i distinctly select group in respect 
to the general population, f inally, in a survey extending down to 
low-grade feebleminded levels, it is likely that several cases present- 
ing special conditions such as cretinism arc included; this would 
further lower the average physical measurements of the feebleminded 
group. 

A more direct answer is provided by studies which employ the 
correlation technique with norma! adult groups. In such studies, the 
complicating intluenees of age and of special pathological conditions 
are avoided. Moreover, the correlation ce^elficient provides an index 
of the degree to vviiich a relationship between physique and inle'li- 
gence exists among all individuals in the group. Average dilTerences 
between groups, especially when slight, may mean little if the over- 
lapping between groups is large. Brooks (8), employing 1118 junior 
high school, normal school, and college students between the ages of 
13 and 20, correlated measures of height and weight with perform- 
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ance on several standardized group intelligence tests. Since correla- 
tions were computed separately for the two sexes and for several 
"fcC groups, the subjects were classified into 1 7 groups ranging in 
number of cases irom 16 to 139. The height correlations ranged 
from -.09 to -f .26; those for weight ranged Trom —.31 to +.26. 
In the study by Sommervillc (92) on college freshincn, described in 
an oarlicr section,^- a correlation of .16 was found between intelh- 
genee and standing height, .13 between intelligence and sitting height, 
and .10 between intelligence and weight. The majority of these cor- 
relations are positive but so low as to indicate little or no appreciable 
relationship between general bodily size and intellectual level, when 
selective factors and other irrelevant conditions are ruled out.^'^ 


PilYSIOlXXlICAt. CONDITIONS 

Efforts to discover what association, if any, exists between various 
physiological conditions and behavior characteristics have followed 
two principal a.pproaehcs. fhe first is a comparison of the relative 
irequenry of a particular physiological condition in groups or indi- 
viduals dilfering in known intellectual or personality characteristics. 

1 he procedure can, of course, be reversed by comparing tlie psycho- 
logical characteristics of individuals chosen on the basis of physical 
condition. The second and somewhat more direct procedure is to 
sec wl’iat arc the psychological eifects of treatment for a given physio- 
logic \l condition. Such “before-and- after'’ studies, wiien feasible, pro- 
vide a more ciear-cul ana!ys*s of causal relationships. The number 
of physiological factors which could be considered in relation to 
beliavior is almost without limit. We ^aall discuss a few typical illus- 
trations which arc of more general interest. 

General Health. Is genera! hciilth related to intellectual level? 
Are miscelianeoLLs plivsiologieal defects such as enlarged glands, de- 
fective breathing, dental caries, diseased tonsils, and other common 
health disorders any more prevalent among dull than among briglu 
persons in the general population? j. number of extensive school 
surveys on this tiuestion have been conducted from time to time, 
using either school achiewmieni or intelligence test scores as indices 

^“Cf. sect it n nr\ C:\iniiil and Ceicbial Mca'-im-nic ns. 

ConolaliOMs loiirul anioiij.: clvlvlicn will be d-scus'scd in the sirtion on Develop^ 
menial Rciabonships. 
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of ability level (2, 17, 41, 57, 78, 94, 100). Some have yielded nega- 
tive or ambiguous results. Among those showing some evidence for 
a relationship between frequency of defects and intelligence is the 
extensive study conducted by the U. S. Public Health Service on about 
4500 school children in two counties in Illinois (41). The most 
relevant results of this survey are summarized in Table 18, which 
shows the relative frequency of each defect among children whose 
IQ’s were under 90 and among those with IQ’s of 110 or higher. 
Each entry in this table is a ratio of the number of defects in the 
designated IQ group to the number found within the normal IQ 

TABLE 18 Relative Frequency of Physical Defects among Bright 
and Dull School Children 


(AtKiptc'l from Ktru'jf and ('olhiis, 41, p. 1772) 


Type of Defect 

Ftequenev 

JQ under ^>0 

of Defect 

10 1 10 or over 

One or more decayed teeth 

111 

82 

Gingivitis 

152 

91 ♦ 

Defective tonsils 

J16 

86 

Adenoids 

123 

68 

Other nasal obstructions 

119 

89 

Enlarged glands: 



(1) Antciior cervical 

317 

81 

(2) Posterior cervical 

99 

79 

(3) SubmaKillaiy 

135 

61 

(4J Thyroid 

114 

94 

Defeciivc liearing (voice test) 

157 

72 

Otitis media 

147 

45 

1 Defective eardrum 

174 

69 

Mastoidectomy (scar) 

68 

222 

Defective vision (Snellen test) 

128 

90 

Conjunctivitis 

325 

135 

Strabismus 

175 

109 

Speech defects 

143 

58 

All heart defects 

111 

75 

Nutrition: poor or very poor 

106 

81 

Posture: poor or very poor 

156 

120 

Fingernail biting 

87 

96 

Evidences of rickets 

310 

115 

Scoliosis 

106 

110 

High arched palate 

127 

n 

Marked dental malocclusion 

136 

89 

All skin diseases 

178 

136 
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range (90 -100), the latter being taken as a standard. The first entry 
in the table, for example, means that the number of children with 
decayed teeth in the dull, normal, and bright groups v/as in the ratio 
of 111: 100: 82. The number in the middle group is always taken 
as 100, and the number in each of the other two groups is expressed 
as a ratio to 100. With only a few exceptions, there is a consistent 
tendency for each defect to be most common in the low 10 group 
and least common in the high IQ group. The investigators also report 
that the average number of dilfcrent defects per child showed the 
same general relationship to IQ. I’he relationship, however, appears 
to be a general rather than a specific one. It is not any particulaj' 
type of defect, but rather the presence of defects as such, which 
differentiates the high and low TQ groups. 

Such group trends should be interpreted with considerable caution 
because of the many mdividual exceptions. A sizable proportion of 
dull children who are completely free from physical defects can be 
found in each survey; bright children with many defects can likewise 
be found. Correlations between intellectual or scholastic level and 
physical conditions have been uniformly low (cf. 65). 

Cicneral health, as rated by a physician, also shows little or no 
appreciable relationship to intelligence within a typical sampling of 
school children. In one study (34) on 343 third and fourth grade 
American school children, those rated ‘‘good” in health had an aver- 
age IQ of 104; those rated “poor” averaged 101. Moreover, those 
children whose health rating improved from the initial to the final 
e\amination showed no more gain in MA than those whose health be- 
came worse. Among intellectually backward children, oil the other 
hand, the relationship between poor h^ dth or physical defects and IQ 
is much closer. In a survey of 14,3^/ 9 retarded school children in 
Massachusetts, large and significant differences in average IQ were 
found between those having various physical defects and those free 
from defect (17). The mean IQ of the entire group, however, was 
70.7, and over 500 children bad IQ’s below 50. The inclusion of 
feebleminded cases may account for tl relationship in such a group. 

Local Infections. It has sometimes been argued that any focal 
infection in the body may, by releasing toxins into the blood stream, 
affect the functioning of the entire organism. Because of its super- 
ficial plausibility, such a thec^ry meets with ready popular acceptance. 
At times it has led to exaggerated claims regarding the psychological 
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as well as physical improvement to be expected from the treatment 
of such conditions as diseased tonsils or infected teeth. The fact of 
the matter is, however, that no significant effects of any such con- 
ditions upon behavior characteristics have been demonstrated when 
proper experimental controls were observed. Most bodily mechanisms 
apparently have sufficient protection against any general effects which 
these local disorders may induce. 

Among the extravagant assertions which have attracted popular 
attention from time to time is the claim tha" dental caries (decayed 
teeth) will interfere with a child’s intellectual development. The evi- 
dence cited in support of such claims has never stood up under critical 
analysis. What few dependable data arc available on this question 
indicate virtually no relationship between dental caries and intellec- 
tual level. Correlations between dental condition and intelligence 
have proved to be uniformly low and negligible (cf. 65, Ch. 6). 
Similarly, extensive dental treatment and prolonged training in oral 
h}gicnc were not accompanied by any greater gain in mental test 
performance than was loimd in a control group which did not receive 
these benefits (42 ). 

Probably because of their relatively frequent occurreneJ among 
children of school age, diseased tonsils have also received their share 
of attention as a possible cause of intellectual backwardness. In the 
effort to test these claims, a carefully controlled investigation was 
carried out with 530 public school boys between the ages of 6 and 
14 (77). All had been given the Stanford-Binet as a part of the 
regular school routine. On the basis of an examination by the school 
nurse or physician, the children were classified into two groups, the 
one composed of 236 boys whose tonsils were sufficiently diseased 
to require treatment, and the other of 294 boys whose tonsils were 
either not defective or so slightly defective as not to call for treat- 
ment. The average IQ's of the normal and defective groups proved to 
be 95.4 and 94.9, respectively. The percentage distributions of the 
two groups arc given in Figure 70. The practically complete over- 
lapping of these groups is apparent from an examination of \hc 
distribution curves. 

As a further check on any possible influence of tonsillar condition 
upon mental development, 28 boys whose tonsils were subsequently 
removed were retested with the Stanford-Binet after a six months’ in- 
terval. The gain in IQ made by this group was compared with that 
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of a control group of 28 boys who suffered from diseased tonsils 
but who had not been operated upon. The operated group made an 
average gain of 2.25 10 points, as compared with an average gain 
of 3.28 in the control group. Finally, it was possible to test 21 sub- 
jects after an interval of from 10 to 17 months following their opera- 
ation. The average gain in IQ made by this group was 3.0 points. 



!• Jligence Quotient 

Fig. 70. Percentage Distiibution ol IQ's of Boys with Normal and with 
Diseased Tonsils. (DaU from Rogers, 7'^ p. 29.) 

while a control group of 3 [ cases gained 6.2 points. It is very doubtful 
whether further retests after a longer delay would reveal any effect 
of the tonsillectomy on intellectual development. The results of the 
two types of procedure followed in tliis study arc thus mutually 
corroborative in demonstrating a lack of relation.ship between intel- 
lectual level and diseased tonsils. Not only w^as there no significant 
difTercnce between the initial lO’s of normal and diseased groups, but 
also removal of the diseased tonsils produced no improvement in 
IQ which could be attributed to such treatment. These findings have 
been corroborated by more recent studies (51, 76). 
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Another popular belief is that hookworm infection produces men- 
tal defect, sluggishness, and apathy. Because of its prevalence among 
school children in certain parts of the country, this infection has 
attracted the serious notice of educators. Several studies have indi- 
cated the tendency for children with hookworm infection to be duller 
than those not so afflicted. In one of the most careful of these investi- 
gations (90), the Otis Intelligence Test was administered to 118 
children in grades 3 to 7 of three rural schools located in a typical 
‘'hookworm area.” Through medical tests, the degree of hookworm 
infestation was also determined for each child. Below are given the 
average IQ's of children in five categories, ranging from a normal 
group, which showed no trace of hookworm infection, to the most 
heavily infected group (from 90, p. 319). 


Intensity of Infection Number of Cases Ayerage IQ 

Normal 17 90.2 

Very light (!-25)-*= 40 88.3 

Light (26-100) 27 86.4 

Moiicrute (lOI-SOOl 23 84.1 

Heavy (501-2000) 10 76.3 

* Estimated nutaber of hookwoims. 


Although the differences in averages arc appreciable if extreme groups 
are compared, the overlapping of all groups is large. When individual 
scores rather than group averages are considered, a correlation of 
.30 is obtained between IQ and degree of hookworm infestation. 

This correlation, although not high, indicates a sornew^hat closer 
degree of relationship than has been found between mental level and 
any of the other physiological conditions so far discussed. The analysis 
of results obtained in investigations on hookworm suggests the opera- 
tion of a factor which is probably present, although to a lesser extent, 
in all studies on the relationship between psychological and physical 
characteristics. The individuals of inferior physical condition in gen- 
eral tend to come from a poorer socio-economic levels their environ- 
ment is deficient in opportunity for intellectual development as well 
as in sanitary conditions, facilities for medical attention, proper food 
and home care, and so forth. This is particularly well illustrated by 
hookworm, a condition which is relatively common among individuals 
of low social status and which flourishes in very poor and backward 


^'‘Computed by Paterson (65, p. 1%), from the data of Smillie and Spenecr. 
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rural districts. The environmental background may be the common 
underlying factor which leads both to the physical and to the psycho- 
logical conditions. This could in itself account for what little relation- 
ship is found between physical condition and IQ. 

Glandular Conditions J'* It is well known that marked over- 
activity or undcractivity of any of the endocrine glands may have a 
pronounced clTcct upon behavior. Within the range of normal varia- 
tion, however, no significant relationship between glandular funciiom 
ing and intellectual or emotional characteristics has been conclusively 
demonstrated (cf. 86). Among the most readily obtained indices of 
glandular activity is the familiar basal mcmholic rate (BMR). This 
is a measure of the rate at which the body uses oxygen, which in 
turn depends upon the degree of activity of the thyroid gland. An 
abnormally low BMR can be raised by the administration of thyroid 
extract. Extreme under activity of the thyroid results in cretinism, a 
condition characterized by feeblemindedness as well as by a number 
of clearly recognizable physical symptoms The milder variations in 
BMR among normal adults or adolcscentsd^* on the other hand, have 
consistently shown negligible or zero cor»*clations with intelligence 
test scores in a number of investigations (cf. 86, p. 604). There is 
some evidence suggesting a significant relationship between BMR 
and scholastic achievement among college students, a relationship 
which may be atlributahie to individual dilTcrcnccs in general energy 
level (58 ). 

A possible association between mild glandular abnormalities and 
personality disorders is suggo. .ed by the relatively large incidence of 
glandular disorders among “problem children.'” In one survey (54) 
of 1000 children who were classifice: as behavior problems, 20% 
showed sonic glandular defect. In 10%, the glandular condition 
seemed to be a causal factor in the bchiivior disorder. That the rela- 
tionship may not be so direct as these data imply is suggested Dy the 
variety ot behavior disorders .vhich are associated with the same type 
of glandular disorder. Conversely, the same kind of behavior disorder 
is found in children with entirely dilferv nf <r]andular defects. As is true 
of many physical conditions, the ivlationship with i^chavior is general 
and not specific. A plausible hypothesis to account for the observed 

'"’For a summajy of ibe data pertaining to the effects of endocrine secretions 
upon hehaviot jii man and animaK. el Ucach (a). 

^‘'For data on HMK in chiJdien. see the section on Developmental Relationships 
in a later part of the present chapter. 
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association between glandular disorders and behavior problems is 
based upon indirect social effects of the abnormal physical condition. 
If the glandular defect handicaps the child or renders him in any way 
different from other children, the behavior problem may simply 
represent the child’s reaction to this abnormal situation. 

“The Internal Environment.” The physical and chemical con- 
dition of the blood, which constitutes the internal environment of the 
organism, is of prime importance in the normal functioning of the 
individual and in the maintenance of life hself. A large number of 
investigations have demonstrated pronounced behavior symptoms fol- 
lowing changes in such conditions as the temperature, oxygen content, 
sugar content, or acid-base brdance of the blood (4, 16, S6). One 
illustration is to be found in the well-known eflects of oxygen depriva- 
tion — as in high altitudes — which include conspicuous alterations of 
sensory, motor, intellectual, and emotional responses (56, 60). There 
is evidence that some of the blood conditions wliich produce tempo- 
rary disturbances of corebial functioning may lead to irreversible 
changes in the brain cells and thus effect permanent behavior modi- 
fications in the individiiah Especially significant arc agents of tliis 
sort operating in early childhood or during prenatal life. Scvfre anoxia 
(oxygen lack) at birth, lor example, may produce brain damage lead- 
ing to motor, intellectual, or emotional disorders throughout life 
(cf. 86). Although some of these blood conditions may themselves 
be genetically determined, others probably depend upon charaettu'- 
isiics of the individiiaf's previous environment. These investigations 
thus suggest an('>lher wav whereby environmental factors may influ- 
ence the individiiafs subsequent behavior development. 

A somewhat dilTcrent question is wliclher individual differences in 
blood chemistry among norma! adults are in any wiiy related to 
behavior difiercnces. It should be noted in this connection that the 
body has a number of regulatory mcehanisms which preserve the 
stability of the internal environment within very narrow limits. T he 
maintenance of this relatively stable slate has been tcimcd ''hoinco- 
stasis.'' One of the important regulatory mechanisms is provided by 
the action of various endocrine glands, which counteract chemical 
deficits or excesses in the blood composition. Owing to such internal 
safeguards, the composition of the blood docs not vary widely among 
individuals or within the same individual under ordinaty conditions. 
Despite this fact, hypotheses regarding the relationship between indi- 
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vidual dilTerenccs in blood composition and intellectual or personality 
traits have been plentiful. The study of the behavior correlates of 
blood chemistry is today an active field of research, but so far the 
data have been contradictory and disappointing. One of the most 
widely discussed of these possible relationships is that between emo- 
tional stability and homeostasis. There is some evidence (25) which 
suggests that the more neunUic individuals Lend to exhibit grcatci 
daily fluctuations in blood composition than the better-adjusted 
individuals."" At best, however, the results on blood chemistry and 
behavior provide interesting leads for future research. 

A number of recent investigations have been concerned with “auto- 
nomic balance,"' by which is meant the interaction between the sympa- 
thetic and parasympathetic branches of the autonomic nervous system. 
Although tliis research is still in an exploratory stage, there is some 
evidence suggesting a ] possible relationsliip between physiological in- 
dices of autonomic balance and emotional and social characteristics 
in children (99), 

Psychosomatic Disorders. A number of cimdi lions such as 
asthma, skin allcrcies, and gastric ulcers ha^c in recent years attracted 
considerable attention because of their possible 
origins. 'I his simply means that psychological factors may serve as 
contributing and in some cases even determining conditions in the 
development of these physical disorders. Many descriptions of the 
so-called iilccr-typc personality haw appeared, although most of 
these descriptions arc based upon the general impressions of clini- 
cians rather than upon contix- led observations (cf. 79). So common 
is the belief that worry, tension, and excessive drive arc associated 
with stomach ulcers that this condition has sometimes been described 
as “Wall Street slomacli.” Less widcl\ known are the thories pro- 
posed regarding other psychosomatic disorders. Some observers have 
suggested, for example, that allergic children are more intelligent or 
more dimiinant in scx'ial relations than non-allcrgic children (cf. 86). 
Other studies, however, have failed 1o corroborate any of these 
claims regarding allergies (80). 

Whether significant relationships bctw'ecn intcllccJual or personality 
characteristics and any of thcNC “psychosomatic” conditions wall be 
found when more and better investigations are conducted remains to 

‘■A fun her discussion of hrmcosUisis will be found in a la.er chapicr on sex 
difTerences (Ch. 19). 
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be seen. Should any such association be established, however, its 
interpretation would still present difficulties. Does the physiological 
condition lead to the behavior manifestations, or vice versa? Are both 
the result of certain environmental factors, such as occupation or socio- 
economic level? Are the psychological effects in part an indirect 
consequence of the social handicap occasioned by the physical con- 
dition? Any one of these relationships could theoretically hold. It is 
probable that all are involved to some extent. 

NUTRITIONAL FACTORS 

The serious food shortages in many countries following World War II 
have given special impetus to the study of the effects of malnutrition, 
and have made this a topic of major social concern. The rapid 
growth of the young science of nutrition and the extensive research 
on vitamins have also served to focus attention upon the amount 
and nature of food intake by the body. Apart from well-established 
physical effects, are there psychological effects of diet? Certainly no 
shred of evidence has been presented to support any claims of the 
diet faddists. That fish is “brain food" and meat makes a person 
more irritable and aggressive arc examples of old wives’ tales and 
no more. More worthy of serious study is the possibility that gc*heral 
malnutrition may have a deleterious effect upon intelligence. All sur- 
veys conducted within the normal range of cither nutritional status 
or intelligence have shown only a -slight positive correlation between 
these two variables (21, 38, 86, 87). F^en this correlation tends to 
disappear when comparisons are made within a relatively homoge- 
neous social group. The influence of socio-cc(moinic level upon these 
correlations is probably similar to that discussed earlier in connection 
with hookworm: the brighter children tend to come from better 
homes, which also provide more adequate diet. 

Studies in which undernourished school children were given special 
diets for a period of several months and brought to a normal physical 
condition have yielded inconsistent results with respect to intellectual 
improvement. It is likely that when positive results arc reported in 
these studies an uncontrolled motivational factor may have operated. 
For example, in one investigation (81), 50 underprivileged children 
were separated into two equated groups, one of which was served 
a special daily breakfast at school throughout the experimental period, 
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while the other was not. The performance of the experimental group 
improved more than that of the control group in both school work and 
standardized tests, but the difference diminished gradually after the 
breakfasts were discontinued. In this situation, the motivational effect 
of the special attention shown the experimental group could account 
for the entire difference in periormance. The results arc therefore 
inconclusive with respect to nutrition. 

There is some evidence that nutritional status may be more closely 
related to intellectual performance among mental defectives than 
among persons of higher intcilcctual level. One investigator (45, 74) 
tested 41 feebleminded children, aged 2 to 7 years, who were under- 
nourished at the time of llie first test and well nourished at the time 
of the second test. The control group consisted of 41 uniformly 
well-nourished children, matched as closely as possible with the ex- 
perimental group in chronological age, 10, and interval between the 
two tests. Following their improved nutritional status, the experi- 
mental group gained an average of approximately 10 10 points, 
while the control group showed no change during the same period. 

It required from 18 to 24 months of the dietary regimen to bring 
about the improvement in IQ. The age of the subject also affected 
the results, the greatest gains being made by children under 5. 

It is also likely that when the diet is close to the subsistence margin 
a closer relationship may exist between nutritional level and intelli- 
gence (cf. 39). Among the most common effects of severe malnutri- 
tion are fatiguability, lack of energy, and lassitude. If prolonged^^..'' 
these conditions would themselves interfere with learning and thijs 
retard intellectual development, even if no other effects of jbal- 
nutrition on behavior arc to t^e found. Similarly, malnutrition in 
combination with other poor health conditions may constitute a suf- 
ficiently serious handicap to interfere wiM normal behavior develop- 
ment in young children. In one investigation, intelligence tests were 
administered to children undergoing outpatient treatment for varioi:is 
disorders cither in a clinic or in a private physician’s office (45). 
Within this group, 50 who wcie classified as undernourished at the 
time of the first examination showed a significant rise in IQ after 
their nutritional status was brought up normal. A control group 
of 50 well-nourished children who w^cre also outpatients at the same 
centers showed no significant 10 change over a similar period of time. 
Since the subjects in this investigation were all patients, it is likely 
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that many of the malnourished children were either severely under- 
nourished or were siifTering from other health difficulties which made 
their total physical handicap more serious. 

Nutrition research has demonstrated that the qualitative aspects of 
diet are even more important than the quantitative. Animal experi- 
ments, as well as clinical observations on humans, have furnished 
ample evidence that serious physical disorders may result from the 
lack of one or more essential vitamins froi?i the diet. A lively area 
of current research is concerned with the psychological elTccts of 
vitamin deficiencies. Because of the known physiological eflects of 
vitamins of the B-group upon the nervous system, most of the be- 
havior studies have concentrated on this group. I'here seems to be 
good evidence that a deficiency in B-\iLamins reduces physical 
strength and endurance (7, 13, 86). Clinical reports (cf. 86) on 
patients with vitamin B deficiency have consistently mentioned irrita- 
bility, moodiness, and lack of coc'jperation. In cases of more severe 
deficiency, apathy, depression, and emotional instability arc observed. 
Relatively few well-controlled experimental studies on the effect of 
vitamin deficiencies upon human behavior arc available, t^id most 
of the investigations have dealt uith too few cases to be concluMve. 
in general, these studies show no diminution of intellectual functions, 
nut only motor and personality changes (7, 27, 86). Nor has the 
administration of excess vitamin B to normal individuals shown any 
consistent effects on behavior. 

On the other hand, there is some evidence to suggest that the 
Continued administration of tJuauiin to children whose diet has been 
sorriowhat deficient in vitamins may lead to significant improvements 
in cer tain behavior functions. A well-controlled experiment (28) was 
conducted on matched pairs of orphanage children whose normal 
diety^vas relatively low in vitamin content. One member of each pair 
tc^cived regular thiamin pills, while the other received a placebo, 
oy '‘bread pill,” as a control. The procedure was such that neither the 
^nildrcn nor any member of the orphanage staff knew which were 
‘ihc experimental and which the control children. Follow-ups over 
a two-year period showed a significant difference in favor of the 
thiamin-fed group in such tests as visual acuity, rote memory, and 
code-learning. The nature of the tests suggests that the advantage of 


One of the B-tomplex vitamins. 
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the thiamin-fed group may have resulted largely from greater alert- 
ness and better ability to concentrate. 

A few investigations have been concerned with the elTects of 
glutamic acid^^ upon psychological functioning. One study (104) 
reported a significant increase in 10 among 44 mentally retarded 
children following the administration of glutamic acid for six months. 
In another, 60 mental defectives were tested before glutamic acid 
therapy, as well as four and eight months following the beginning 
of treatment (23). As a control measure, part of the group received 
glutamic acid and part received a placebo at the start, the procedure 
being reversed foi some of the subiects after four months. Preliminary 
results indicated a small hut significant rise in IQ following the glu- 
tamic acid therapy, while no improvement followed the administra- 
tion of the placebo. Th<. permanence of the improvement noted in 
such investigations cannot, of course, be determined without more 
cxtensi\e iollow-ups. Moreover, it is possible that the observed intel- 
lectual improvement may have resulted largely from an increase in 
alertness following the glutamic acid therapy 

One of the few intensive and w ell-con ti oiled experimental studies 
of the effects of nutrition upon human behavior is that conducted at 
the Laboiatory ol Physiological Hygiene of the l^niversity of Min- 
nesota (11, 12, 26). dhirty-four men between the ages of 21 and 
33, who volunteered for the experiment, weic kept for six months 
on a semi-starvation diet described as characteristic of huropean 
famine conditions. As a stand^id, i.ach subjeefs normal performance 
during a three-month period of adequate diet was recorded. 'The 
daily intake of calories averaged 3150 during the preliminary normal 
period and 1755 during the second, or experimental, period. The 
average weight loss of the subjects during the semi-starvation period 
amounted to about 25%. 

The clearest behavior change during the experimental period was 
a decline in strength and endurance in motor tasks, and a less marked 
but significant loss in motor speed and coordination. No sensory 
effects were noted except a rise in auditory acuity and an increased 
sensitivity to cold. On a scries of tests of intellectual functions, no 
change in cither speed or level of performance was observed; nor 
was learning affected. In contrast to this lack of impairment as dctcr- 


‘"^Cjlutamc acid is one of Ihe essential ammo acids derived from proteins. 
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mined by objective tests, the subjects believed that they had deterio- 
rated sharply. Self-ratings iii alertness, concentration, comprehension, 
and judgment dropped markedly in the course of the experimental 
period. These differences are probably related to the personality 
changes, which were conspicuous. Personality tests (Minnesota Multi- 
phasic and Guilford-Martin Inventory) showed a statistically sigmli- 
cant increase in depression, hysteria, hypochondria, nervous sym- 
toms, feelings of inadequacy and inferiority, and introversion; a 
decrease in general activity and in social Icadciship was likewise 
indicated. Constriction of interests and obsessive preoccupation with 
thoughts of food were very apparent. 

The third stage in the experiment consisted of a l2-weck con- 
trolled nutritional rehabilitation period, in which the caloric intake 
was increased b> different amounts in different sub-groups. The ex- 
perimental design also included groupings in which the diet was 
supplemented with protein or with vitamins. The weight rises during 
this period varied with the amount of caloric intake, but were not 
significantly related to vitamin or protein supplementation. The effects 
on motor, sensoiy, intellectual, and personality functions parTilleled, 
in reverse, the previous changes during semi-starvation. The improve- 
ment was large in motor functions and in personality characteristics, 
but no significant change was found in intellectual functions. Vitamin 
and protein supplementation produced no significant differences in be- 
havior recovery, although the amount of caloric intake did. This 
experiment indicates that the behavior changes induced by semi- 
starvation are reversible and remediable. Jt should be remembered, 
however, that such a finding applies to a six-month period of inade- 
quate nutrition in adults. What would occur in a child, or following 
a longer privation period, we cannot infer. 

DI Vi.LOPMlNlAL R1 1 AnONSHIPS 

In an earlier chapter (Ch. 5), we discussed the structural changes 
which parallel changes in behavior in the growing organism. The 
study of such changes represents a developmental rather than a 
statistical approach to behavior differences, and has bcci limited 
principally to prenatal or early postnatal stages of development. T he 
studies which have been considered in the present chapter, on the 
other hand, have been concerned with correlations between struc- 
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tural and behavior differences in relatively mature organisms. These 
distinctions and classifications become less sharp as the intermediate 
areas which bridge the gap between isolated fields of research are 
developed. Correlational studies of physical and psychological char- 
acteristics among children are an example of such a marginal area 
of investigation, since both individual differences and developmental 
differences within the individual contribute to these relationships. 

In the case of traits which show appreciable age changes, any 
correlations found among children should be considered apart from 
similar correlations obtained in adult groups. A relationship present 
in the growing organism may disappear when maturity is reached, 
since it may have resulted simply from developmental inffuences. It is 
obvious, of course, that a 10-ycar-old will excel a 5-ycar-oid in both 
arithmetic and height. Thus if 10- and 5-year-olds arc included within 
the same group, an artificial correlatitm will he obtained between 
arithmetic and height. Such a “spurious'’ correlation is usually elimi- 
nated through the use of relative measures (e.g., 10) or through 
comparisons within a single age group. But these procedures do not 
rule out the entire contribution of developmental differences, since 
children of the same chronological age may vary widely in the degree 
of physical development whicii they have attained. 

'Micre is no consistent relationship betw^ecn developmental rate 
and adult status in physical characteristics. Tlie data on age ot onset 
of puberty furni^h a good illustration of this point. Individuals who 
reach sexual maturity earlier arc generally accelerated in physical 
development fiom early childhood (89). The age of emset of puberty 
is thus one manifestation of the individuars general rate of physical 
development. Ouring childhood, the e. dier-maturing individuals will 
be taller, heavier, and farfficr advanced in most physical character- 
istics than those who rcacli puberty later. But in adulthood, those 
who reached puberty earlier are not taller or heavier. In fact, a slight 
tendency has been found foi earlier-maturing gjrls to be somew^hat 
shorter during the late teens (89. 9b). The tailcst child in a group 
will not nccessarilv be the tallest two.'v years later. The physical 
status of a child depends in part upon certain absolute factors which 
make some individuals, for example, taller tiian others throughout 
life, and in part upon individual chffeiences in developmental rate. 

In the light of these considerations, it is perhaps not surprising 
to find that correlations between anatomical or physiological charac- 
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teristics and intelligence tend to run higher among children than 
among adults. These correlations arc still quite low, rarely exceeding 
.30, but they are often high enough to indicate a statistically signifi- 
cant relationship.-*' In an extensive study by Abernethy (1), for 
example, positive correlations were found between various anatomical 
measures and intelligence at all ages from 8 to 17, but the correla- 
tions tended to be lower in the groups which were approaching 
maturity, in comparable adult group included in this study, the 
correlations were virtually zero. With young ciiildrcn, ranging in age 
from 3 months to 8 years, Bayley (5) found average correlations of 
.16 and .15 between the height and intelligence of boys and girls, 
respectively. There is also some evidence that “skeletal age,’' as de- 
termined by X-ray photographs of bone structure, is significantly 
correlated with intelligence in children, and that the correlation 
diminishes with age (18). 

Some startling!) high correlations have been reported by Hinton 
(33) between basal metabolic rale and 10 in a group of 200 children 
ranging in age from 6 to 15. These correlations were close to .80 for 
the 6- to 9-year-old groups; from age 10 on they dropped faiijy con- 
sistently, reaching a value of about .50 among the 15-ycar-olds. It 
will be recalled that investigations on adolescents and adults showed 
virtually zero correlations between BMR and intelligence. If Hinton’s 
results arc confirmed by other studies, they may provide an interest- 
ing illustration of age changes in the relationship between bodily 
conditions and behavior. It may be noted that the BMR tends to be 
higher during periods of rapid growth. If BMR is shown to be signifi- 
cantly related to intellectual level in childhood, this may help to 
explain many of the other correlations. 

It has been argued that, in both their physical and psychological 
development, some individuals nitiy progress at a more rapid rate 
than others throughout their period of growth. According to this 
hypothesis, it is these differences in cle\eh)pmrntal rate which may 
account for the slight positive correlations found between intelligence 
and certain bodily characteristics among children. It should be re- 
membered, however, that growth does not occur at a uniform or 
regular rate within the individual, but exhibits many irregular spurts 
and lags. These temporary fluctuations in rate of growth arc quite 
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specific, and no parallelism has been discovered between psycho- 
logical and physical fluctuations within individual growth curves. 
Thus the monthly or annual increments in structural and in intel- 
lectual status are generally uncorrelaled (1, IR). This suggests that 
whatever relationship exists between bodily and bchavorial develop- 
ment is probably an indirect one. For example, the child who is 
physically accelerated is likely to learn to walk — and possibly talk — 
earlier, thereby expanding his environmental contacts in advance of 
the slower-maturing individual. This could account for a slight dif- 
ference in intellectual development in favor of the earlier-maturing 
child. On the other hand, the temporary ups and downs in physical 
and psychological development seem to result from a multitude of 
unrelated factors, and offer no support to the theory of a ‘'common 
underlying growth tendency.” 

The eflecl of puberty upon behavior development has itself been 
widely discussed. Contrary to popular belief, there is no evidence 
that intellectual development is cither consistently accelerated or hin- 
dered by the onset of sexual maturity (1, 18). Nor is there any 
relationship betw^een age of sexual maturity and cither intellectual 
or personality characteristics in adultliood, when racial and cultural 
difTerences are held constant (1, 96). The onset of puberty docs, in 
general, usher in changes in attitudes, interests, and emotional re- 
actions. In one survey (97), significant difTerences were found be- 
tween the personality test responses of pre-pubcrtal and post-pubertal 
girls of the same chronologic I age and comparable socio-economic 
and cultural status. The important role of social factors in bringing 
about these personality changes cannot however, be overlooked. 

SHNSORY HANDICAPS 

Sensory limitations have a much more direct bearing upon behavior 
than most other kinds of physical deficiency, since they cut olT 
environmental stimulation. I'he individual so afflicted is ps^^cho- 
iogically “isolated" from cultural contaas in the same sense as the 
wolf children of Midnaporc or Kaspar Hauser (cf. Ch. 6). We should 
therefore expect a fairly pronounced behavioral deficiency to be 
associated with sensory defects. For man, visual and auditory defects 
arc obviously the most serious sensory handicaps. Since human cul- 
ture is built to such a large extent upon a foundation of language — a 
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language acquired principally through the eye and the ear — defi- 
ciencies in these sensory areas arc of basic significance. 

Visual Handicaps. Any over-all estimate of the average intel- 
lectual status of the visually handicapped is extremely difficult and 
probably meaningless (cf. 29). One reason is that intelligence tests 
devised for the blind are not usually comparable to those for sighted 
children because of the omission and substitution of tests, changes in 
administration, and other alterations necessitated by the visual handi- 
cap. I’he most nearly comparable test is Hayes’ recent adaptation 
cf Forms L and M of the 1937 Slanford-Binei for blind children 
(31). For blind adolescents and adults, the Wechslcr-Bellevue can 
be used with only minor modifications (30). These tests have not 
been in use very long, however, and most of the large-scale com- 
parisons between blind and sighted are based on earlier, less com- 
parable tests. 

Another point to consider in such comparisons is the cause of 
blindness. In some cases, blindness results from pathological condi- 
tions which also lead to neurological deterioration, if the IQ's of 
such individuals are included in the total estimate, ihey sinqfly con- 
fuse the picture. The intellectual achievement of blind children also 
depends upon the amount and nature of special education which they 
have received. Such training tend^ to compensate for the visual handi- 
cap b> providing the necessviry contacts with the social environment 
through other sensory channels. With the marked progress in methods 
of instruction lor the blind, it is likely that the average iG of children 
in blind schools today is higher than it was twenty years ago — and that 
twenty years hence it will be still higher. 

I he aye of onset of blindness is likewise related to the amount of 
intellectual handicap, although the relationship is not a simple one. 
On the one hand, the later the loss of vision occurs, the more oppor- 
tunity the individual will have had for normal educational experiences. 
On the other hand, such an individual will have had less time to adjust 
to the blindness, and may encounter more interference m the acqui- 
sition of the new reaction systems required by the loss of vision. These 
two opposing influences probably account for the lack of correspond- 
ence generally found between age of onset of blindness and intelli- 
gence test performance or educational achievement (29). Another 
factor contributing to the intellectual development of the blind child is 
his emotional response to the handicap. The attitudes of his family and 
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associates, the general nature of the home milieu, and many other 
attendant circumstances will determine how effectively the individual 
adjusts to the handicap and will indirectly affect his educational and 
intellectual progress (cf. 91). 

A final and very important consideration in connection with the 
general intellectual level of the visually handicapped is the extent of 
the defect. Like all psychological characteristics, vi.don tends to follow 
a normal distribution in the general population. Between the large 
'‘normal” group and the totally blind, one finds innumerable degrees 
of handicap along a virtually continuous .scale. Sharply distinguished 
categories arc just as out of place here as in other aspects of individual 
differences. For practical convenience, a threefold classification is now 
generally employed, including correctable defects, partially seeing, 
and blind. The per cent of children falling into each of these eale- 
gorics, as estimated by the White House C’onicrent.e on Child Health 
and Protection (lUl), is as follows: 

Con cclablc dcIccts 

Pan Killy seeing U li) 

Blind ^ 0 05 

Correctable visual defects', nctiuilly coircctcd by llie use of 

glasses, have no cflect upon intellectual development. If the child 
wears glasses from the lime when the defect becomes appreciable, 
no interference with normal environmental contact results. When the 
defect is not compensated by means of lenses, liowevcr, the child’s 
school W'ork, and indirectly hi ‘nteilectiial development, usually sufTer. 
Inattention, lack of interest in school, loss o1 selhconlidcnce, and 
inlcrior performance may result from i nsuspcclcd visual deficiencies. 

'Ihc icrni “pen/h/Z/v seeing' it; app'icd to children whf^se visual 
deficiency is so serious as to necessitate special instructional tech- 
niques in sight-saving classes, w liere classroom procedures arc adapted 
to a limited use of vision. Survexs- (3, 10 1) in such sight-saving 
classes have shown an average JQ of about 00. The distribution is 
quite skewed, with a marked piling up nf eases at the lower IQ liwels. 
About 50% of the children have lO's beiow 90; and of these, from 
6% to 10% arc below' 70. Less than on the other hand, have 

Roughly, the bmils :irc between 20 "70 and 20 200 vis.on, ailhoiigh olhei factors 
arc also taken into act omit. 

""in one ot these surveys v'a), an adaplalicn of the Stanford Binet \^as em- 
ployed ill wh'ch ceitair. tests wcie reproduced m magnihed Itirm and with heavier 
and darker lines; in the other (101), the Lcs* is not specjlicd. 
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IQ’s of 1 1 0 or higher. At the time of their admission to sight-saving 
classes, many of these children show personality disorders associated 
with their visual handicap (3). Introversion, daydreaming, feelings 
of inferiority, and tension have been most frequently reported. 

Ihe blind have been defined as those who cannot be educated 
through visual means. Within this group, the individuals with a little 
vision arc sometimes the most retarded (29). Because they are less 
highly motivated to learn to depend upon touch, such individuals tend 
to dissipate their efforts. Certain selective factors operating in admis- 
sion to schools for the blind also tend to produce a difference in the 
same direction. Thus a bright child with a marginal amount of vision 
is more likely to succeed in a sight-saving class, while a dull child 
with the same amount of vision may fail and be sent to a school for 
the blind. Several surveys conducted in schools for the blind have 
shown an average retardation of from two to three years in school 
progress, but only a slight retardation in intelligence test perform- 
ance. As nearly as can be estimated, the average IQ of blind children 
is slightly above 90, and the per cent of IQ s in the subnoimal levels 
is about twice as large as that in the superior levels (29, 71 ).On the 
whole, the intellectual handicap seems to be no w'orse than that of 
the partially sighted, and there is some evidence that it is slightly less. 

In such tasks as learning a rpaze, tlie blind do somewhat better 
than blindfolded normal subjects (29, 71), probably because of tlic 
greater familiarity of the blind with the use of non-visual cues. There 
is no evidence, however, for the popular belief that the blind liave a 
finer discrimination than the sighted in other senses, such as hearing 
or touch (SO). The remarkable feats often accomplished by blind 
persons through the use of other senses stem from a more clficient 
use of sensory cues rather th.an from a superiority of the senses them- 
selves. Through prolonged training, an individual may acquire the 
ability to respond to very slight cues whicli are ordinarily ignored. 
Such seems to be the case among the blind. The so-called obstacle 
sense of the blind, which enables them to perceive obstacles in their 
path, has been showm to be based primarily upon learned responses 
to auditory cues (102). 

In personality development, the adjustment made to the visual 
handicap varies widely with the individual. The range of personality 
characteristics is fully as wide among the blind as among the sighted. 
There is some evidence (10, 61) that the number of neurotic symp- 



Bodily Conditions and Behavior 


409 


toms among blind adolescents is significantly greater than among com- 
parable sighted groups, but the difference is smaller than might be 
expected in view of their handicap. Emotional maladjustment tends 
to be less among the totally blind than among the partially sighted 
(61 ) . Moreover, it seems fairly clear that it is not the defect itself, but 
the social treatment, which is at the basis of the insecurity and other 
emotional difficulties of the blind (52, 53). 

Auditory Handicaps, Contrary to popular belief, hearing defi- 
ciencies constitute a more serious handicap to intellectual development 
than do visual defects. Deafness interferes more than blindness with 
language development and with normal social contacts. In estimating 
the intellectual handicap occasioned by hearing deficiencies, the same 
difficulties are encountered as in the testing of the visually handi- 
capped. A working classification of auditory deficiencies has been 
devised which closely parallels the threefold classification of visual 
handicaps discussed above. It has been estimated that about 14% of 
school children have defective hearinij, a category referring to milder 
hearing disabilities ( 3 ) . Such handicaps more often escape notice than 
the visual, and the child's behavior is mistaken for carelessness, indif- 
ference, rudeness, or dullness. Among the effects of such handicaps 
upon the child are poor scholarship, speech defects, loss of interest, 
social ak^ofness, and suspicion (3). 

Those classified as hard-of-hearin<^ have a more conspicuous de- 
fect, but are nevertheless able to use hearing in acquiring an under- 
standing of spoken language.^ ’ Tn general intelligence, language devel- 
opment, and educational progress, they arc intermediate between those 
with minor hearing deficiencies and tlv' deaf (3, 71, 72, 95). There 
is some evidence to suggest that on such tests as the Pintner Non- 
Language Scale, hard-of-hearing school children are not inferior to 
their normal-hearing classmates (20). When hard-of-hearing children 
are matched wfith normal-hearing classmates in non-language intelli- 
gence test score, however, the hard-of-hearing do more poorly than 
the normal-hearing in tests of scholastic' achievement (20). 

Among the deaf arc included those ii.dividuals whose hearing de- 
ficiency is so serious as to prevent the acquisition of language in the 
ordinary environment. Formerly known as “deaf-mutes,” such indi- 
viduals provide a vivid demonstration of the influence of environ- 

Either because the handicap is less severe than that of the deaf or because the 
loss of hearing occurred after the acquisition of language. 
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mental stimulation upon the development of an important behavior 
function. Never having heard the human voice, the “deaf-mute” is 
unable to speak, although his vocal organs are perfectly normal. 1'he 
presence of human vocal organs does not in itself lead to the develop- 
ment of human speech, any more than any other structure insures the 
appearance of the function ordinarily associated with it. Vocal organs 
of a certain type arc a necessary but not a sufficient condition for the 
acquisition of speech. That the deficiency of the “deaf-mute” is a 
stimulational one is shown by the fact that, with modern teaching 
methods based upon the use of other sensory cues, such individuals 
can be taught to speak normally. The remarkable results achieved 
with certain persons who were both blind and deaf point up still fur- 
ther the importance of training in behavioral development. The most 
famous examples arc Helen Keller and Laura Bridgman, who 
achieved considerable eminence in their busy careers despite this dual 
handicap. 

A number of extensive test surveys have been conducted in schools 
for the deaf (cf. 7l ). Educationally, such groups arc as much as four 
or five years retarded. On the usual verbal-type intelligence tCst, the 
deaf experience considerable difficulty because of their deficient mas- 
tery of language and linguistic concepts. So great is this handicap, that 
verbal tests arc generally considered inapplicable to deaf children, 
even though such tests may involve no spoken language. The problem 
of testing the deaf was, in fact, one of the principal reasons which led 
to the construction of the early non-language and performance scales. 
In one of the most comprehensive surveys of deaf children (cf. 71, 
p. 118), the Pintner Non-Language Test was given to 4432 children, 
1 2 years of age or older, in 41 schools for the deaf. The average MA 
and IQ of the children in each year group from 12 to 15 were as 
follows; 

A V erage MA A verage I Q 

10 - 9 86 

11 - 2 84 

1 1 - 8 83 

12 - 1 82 

On performance tests, the mean TQ of deaf children ranges from 
slightly below 90 to slightly above 100, depending upon the nature of 


C hrtmological 
Age Level 

12 

13 

14 

15 
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the tests.-'^ It is likely that this difference is largely a matter of the 
degree to which language concepts aid in the performance of the test. 
Since language serves an important function in so much of our think- 
ing, the linguistic retardation occasioned by deafness handicaps the 
individual in a fairly broad area of intellectual activity. No relation- 
ship has been found between the age of onset of deafness and scores 
on non-language or performance tests. Educational achievement, on 
the other hand, is clearly better when the loss of hearing occurs at 
about 4 years or older, after the normal acquisition of language (71). 

In personality development, the deaf and hard-of-hearing show 
many of the same reactions noted among the visually handicapped. 
They tend as a group to be somewhat more emotionally unstable, in- 
troverted, shy, and insecure than normal-hearing persons f 14, 56, 
69, 71 ). Deaf children also lend to be below the norms in social matu- 
rity, are less likely to be leadeis, and present more behavior problems 
than other children (63, 71 ). In general, the more severe the hearing 
handicap, the greater the personality maladjusiment (71), a relation- 
ship which did not hold in the case of the visually handicapped. It is 
interesting to note that deaf children who come from homes in which 
there are deaf adults tend to be better adjusted than those reared in 
homes in which all the adults have normal hearing (70). This sug- 
gests the dependence of the deaf child's emotional adjustment upon 
proper adult understanding of the child's problems during his forma- 
tive years. 


(.I'NI RAL I-A ALI'ATION 


Our fundamental question in this cha[ er has been: how arc individ- 
ual ditTcrenees in behavioi related to individual dillercnce^ in bodily 
conditions? Let us see what sort of an answer the data have provided. 
First, we must recognize that certain pathological conditions of the 
organism have characteristic physical as well as behavioral symptoms. 
Rut we cannot generalize Irom the association fecund in these abnor- 
mal cases to a p(>>sible connection wiie^ : the normal range of varia- 
tion. do take an obvious and extreme illuslration, a peison whose legs 
have been anipiitadecl at the knee is usually unable to dance. It docs 
not follow, however, that length of leg is correlated with ability to 


i \ 46 , 5 ?, 63 , 71 , 93 , iu}. 
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dance and that those persons with longer legs will make the better 
dancers. 

Aside from the relationships which have been demonstrated in 
pathological conditions, surprisingly little is known directly — although 
much has been inferred — regarding the operation of physiological 
factors in behavioral development. In the held of endocrinology, for 
example, much remains to be learned. Too often what has been offered 
as a stimulating hypothesis for further research has been interpreted 
by the layman as an established fact. The same may be said in regard 
to nerve physiology. The held abounds in speculation and the experts 
still disagree. At such a stage, it is dehnitely premature to venture a 
systematic analysis of behavior differences in terms of the nervous 
system. 

Turning from the observation of pathological cases and speculations 
on the physiological mechanisms underlying behavior to data collected 
on normal groups, we still meet difficulties. Many of the investigations 
on this problem have been inadequately controlled. Through the mis- 
interpretation of statistical techniques, slight general trends in groups 
have been erroneously attributed to individual cases. It willj^e re- 
called, for example, that the small differences in group averages, 
which were regarded as significant by many early workers, actually 
showed only a negligible relationj^ip when the individual scores were 
correlated. The pronounced overlapping of groups was often ignored. 
Age differences were occasionally present within the groups, thus pro- 
ducing a spurious connection between certain physical characteristics 
and intellectual level. In many investigations showing a relationship 
between physical condition and intelligence, differences in socio-eco- 
nomic level may account for whatever positive correlation has been 
found. The individual who comes from a better home will have richer 
opportunities for intellectual development and at the same time will 
receive better physical care. He will be brought up under more sani- 
tary conditions and will have less chance of contracting disease than 
the less fortunate child reared in a city slum or a poor rural district. 

All in all, the available data furnish little evidence for a bona fide 
connection between behavior characteristics and physical conditions 
among normal persons. Many fields of research within this area, how- 
ever, have barely been touched, and more information is obviously 
needed for a definitive answer. The explanations which have been 
proposed for such relationships fall into four major types. The theory 
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which prompted the investigations of Galton and other early workers 
is that of *'constitutionaV* inferiority or superiority. In contrast to the 
popular notion of compensation, this theory maintains that “good 
things go together” and that the person who is superior in one respect 
tends to be superior in others. Any positive correlation between psy- 
chological and physical traits is attributed to a common “quality of 
the organism” which underlies all forms of development. Recent 
studies lend practically no support to this interpretation. 

A second possible typ>c of relationship may be described as direct 
behavioral handicap. This is best illustrated by pathological extremes, 
such as cerebral anoxia, the extremely small brain of the microceph- 
alic idiot, or the underactive thyroid of the cretin. In such cases, the 
“minimum structural prerequisites” for normal behavior development 
are absent. It is bccomi.ug increasingly apparent, however, that among 
the large majority of individuals the diiect coiitrol of behavior by 
structural factors is not very rigid. Beyon*3 a certain essential mini- 
mum, further dilTercnccs in structural characteristics arc not neces- 
sarily accompanied by corresponding diMerences in behavior. To put 
it dilTcrcntly, the structural equipment of most individuals permits a 
very wide latitude in behavior development. 

Another explanation is based upon the indirect behavioral handicap 
resulting from physical deficiencies. This handicap can take many 
forms. In the case of sensory dcricicncies, there is a partial stimula- 
tional isolation of the individual. Malnutrition, poor health, and other 
general physiological conditiv is reduce endurance, increase fatigua- 
bilily, affect muscular development, and generally lower the eliiciency 
of work. These conditions, if sulTicieiV’v prolonged, may be expected 
to retard intellectual dcvelnpmcnt to a :ertain cMenl. Physi'ial defects 
or discomforts also serve as a poweiful distraction and thus make it 
more ditficult for the child to concentrate on his school work or other 
tasks. Finally, certain striking facial, cranial, or bodily characteristics 
which have acquired a specific signlhcance through social stereotypes 
may affect the individuars subsequent intellectual and emotional de- 
velopment, because of the attitudes which they engender. The social 
consequences of poor health (>r of sensory handicaps exert a similar 
inlluencc upon behavior. 

A fourth aiid final type of relationship, which has come to the fore 
in recent years, is that implied by the term psychosomatic ” In this 
case, the psychological condition is logically regarded as the ante- 
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cedent, and the physiological disorder as its consequent. One of the 
clearest examples is to be found in the digestive, circulatory, and other 
internal changes occurring during emotional excitement. It is reason- 
able to suppose that continued stimulation of such physiological reac- 
tions may lead to a more lasting disruption of function, as illustrated 
by the development of gastric ulcers. 

In conclusion, the evaluation of any investigation purporting to 
show a relationship between physical and psychological characteristics 
involves two questions. First, is the relationship genuine, or does it 
result from socio-economic or other uncontrolled conditions? Sec- 
ondly, when a genuine relationship has been demonstrated, what is its 
specific nature, and how can the relationship be explained? 



CHAPTER 

13 


The Quest for 
Constitutional Types 


The relationship between physical and psychological traits has also 
been considered from tlu point of view of coustiiiitional types. In the 
cITort to simplify the almost infinite observable variations among indi- 
viduals, certain basic human types have been proposed. A specific 
individual can then be described as a more or less close approxima- 
tion to one of a small number of types. Such constitutional types arc 
offered as a characterization of the individual as a whole, in all his 
physical, intellectual, and emotional traits, and are not to be envisaged 
in terms of any isolated qualities of the organism. There is also a 
strong presumption of an innate or hereditary basis for the develop- 
ment of types. Thus a theory of constitutional types implies a certain 
degree of conformity among the various characteristics of the individ- 
ual, these characteristics bein^, ultimately attributed to an underlying 
innate tendency^ 

Type theories have been eagerly reci ived by the general public as 
a short-cut to the understanding of huiran nature. I'he layman is im- 
patient with the slow, meticulous methods of science. This is particu- 
larly true in psychology, because of the more intimate and immediate 
bearing which this science has upon man’s everyday life. The termi- 
nology of type theories has become such an iiijegral part of our lan- 
guage that it is almost impossible for us speak about people without 
inadvertently lapsing into hypothetical categories. The popular tend- 

’ The concept of tvpes has also been employed in the dcsciiption of specific 
functions, as in Galton’s classification of individuals in regard to their predominant 
field of imagery, i.e., visual, auditory, olfactory, etc (12). Such types, however, do 
not characterize the pcisonahty as a whole and are not to be confused with the 
constitutional types under conskieralioti. 
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ency to make sharp distinctions, coupled with the hope that character 
and mentality can be “read’' from physical signs, has helped to keep 
“types” alive. Among psychologists, there have been recurrent revivals 
of interest in typology. As new theories appear, they arc followed by 
a iiurry of hopeful research. In the sections which follow, we shall 
consider some of the best-known type theories, inquire into their 
psychological implications, and examine some representative data col- 
lected to support or lest their claims. 


TYPE TUI ORll S 'IIIROUOIl THE AOl S 

The first clearly formulated attempt to classify individuals into oasic 
types was probably that ol the Cireek physician Hippocrates in the 
fifth century u.c. Hippocrates proposed a twofold division into Jiuhitiis 
apopleciiciis and liahiius' p/uliisk as. The former corresponds to a 
thick-set, heavy body build, susceptible to apoplexy and similar ph>si- 
cal disorders; the latter is characterized 1)> a long, slender bod> and 
susceptibility to respiratory di^eases such as tuberculosis, of 

the predominantly medical interest of its exponent, this chu^ifieation 
was based primarilv upon relative susceptibility to dillerent kinds of 
physical ailments. Such an approach has. however, peisislcd to tiic 
present, many current type theoiies taking susceptibility to various 
ph\sical 01 mental disi)rders as their starting point. 

d’he sec(md-cciHur\ (ireek physician (ialen, tiec]uentl> called the 
father ol modern medicine, is lesponsible for the well-known classifi- 
cation of “temperaments” into the sanguine, the cludeiic, tlie phleg- 
matic, and the melancholic. These terms have achieved great pc»pu- 
larity as descriptive figures of speech, and one wondcis how often 
they arc still being taken lltcrall). The theories of both Hij'ipoerates 
and Cialen were founded upon a biochemical approach to personality. 
1 luis Hippocrates attributed the development of his two types to the 
relative proportion c»f “fire” and “water” elements in the individual’s 
make-up. Cialen ascribed his four temperaments to the excess of one 
or another of four “humors” or body fluids. 

In more modem times, many variations of tvpe concepts have ap- 
peared in literature, ait, philosophy, medicine, anthropology, and any 
other field in whieli man is the cential figure. Hvcr> school child is 
familiar with the quotation from Act I of Shakespeare’s Julius CUiescir: 
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Let me have men about me that are fat; 

Sleck-hcaded men and such as sleep o’ nights. 

Yond Cassius has a lean and hungry look; 

He thinks too much: such men arc dangerous. 

The nineteenth and early ivvcnticlh centuries have been described 
as the golden age of typologies (38). The English anthropologist 
Walker, “ in 1852, wrote of "nutritive beauty,” "locomotive beauty,” 
and "mental beauty.” Jn the following year, Carus,“ a German zoolo- 
gist, described three bodily types: the phlegmatic, in which the region 
of the digestive organs is prominent; the athletic, with strongly devel- 
oped bones and muscles; and the asthenic, with narrow chest, a long 
body, and poorly developed skeleton and musculature. In France, 
several type theories were proposed, chief among which was that 
formulated by F^ostan (35 ) in 1828 and later adopted by Sigaud :ind 
his students. This classification recognized four types: digestive, mus- 
cular, respiratory, and cerebral. Manouvrier suggested a division into 
makroskele and brachyskcle, or narrow skeleton and broad skeleton. 
MacAulilTe offered the type plat (flat) and the type rond (round). 

In Italy, Viola (cf. 30) formulated a theoiy which became familiar 
lo psychologists through the researches of Naccarati (30, 31) and 
others in America. Viola's types include the macrosplanchnic, the 
normosplanchnic, and the microsplanchnic. The macrosplanchnic pos- 
sesses a large trunk which is overdeveloped in comparison with the 
length of the limbs; the horizontal dimensions are relatively large, the 
vertical relatively small. The microsplanchnic, on the other hand, has 
a small trunk and long limbs, the vertical dimensions being relatively 
in excess of the horizontal. Between these two extremes is the normo- 
splanchnic, which exhibits a proportio/:atc and harmonioir- develop- 
ment of trunk and limbs. Viola suggested a series of body measure- 
ments to be employed in classifying individuals into these types. 
Naccarati (30) later substituted a single numerical expression of body 
build, the morphologic index, computed as follows: 

length of one arm -j- length of one leg 
volume of trunk 

The trunk volume in this formula is determined by a scries of rather 
elaborate measurements. 


‘ Cf. Wertheimer and Hesketh (46), 
'^Cf. Wertheimer and Hesketh (46). 



424 Differential Psychology 


According to Viola’s theory, the macrosplanchnic represents an 
overdevelopment of the nutritional or “vegetative system” contained 
within the trunk. The microsplanchnic, on the other hand, is charac- 
terized by an overdevelopment of the “animal system,” consisting of 
the musculature, nervous system, and skeleton. The two types were 
believed to differ in intellectual and emotional characteristics as a 
result of the relative activity of the vegetative and animal systems, 
which were regarded as independent and even antagonistic in their 
action. In his elaborcition of Viola's theory, Naccarati (30) suggested 
that the microsplanchnic corresponds to a hyperthyroid condition and 
should therefore be expected to manifest the various characteristics 
associated with overactivity of this gland. 

Pende (cf. 46) subsequently proposed a distinction between hyper- 
vegetative and hypovegetative biotypes, a classification which, as the 
terms imply, has much in common with Viola’s theory. A definite 
endocrine basis was offered for this distinction. 

In America, Davenport (10) classified individuals into the fleshy, 
the medium, and the slender biotypes. Stockard (42) distinguished 
between the linear and the lateral types, which he related to the 
activity of the thyroid. The linear type was described iis active, ener- 
getic, and nervous, but emotionally controlled; such individuals grow 
rapidly and reach puberty at a relatively early age. The lateral type 
is less active and grows at a slower rate. The linear type is also char- 
acterized by a dolichocephalic skull, the lateral by a brachyccphalic 
one. Mention should also be made, from the psychological side, of the 
famous distinction proposed by William James (18) between “tender- 
minded” and “tough-minded” persons, a distinction which bears a 
certain resemblance to the introvert-extrovert classification to be dis- 
cussed shortly. 

Pavlov (32), the Russian physiologist of conditioned reaction fame, 
suggested a type classification in terms of the nervous system. On the 
basis of observations made in the course of his conditioning experi- 
ments on dogs, he proposed two predominant, opposed types, corre- 
sponding to extreme tendencies toward excitation or inhibition, re- 
spectively. Intermediate, less pronounced types were also described. 
Pavlov called attention to the resemblance between the classification 
so obtained and the classical division into sanguine, melancholic, 
phlegmatic, and choleric temperaments. He suggested that, “Until a 
rigid scientific classification is fully established for all the various 
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types of central nervous system ... we may be permitted to make 
use of the ancient classification of the so-called temperaments” (32, 

p. 286). 

Within the present century, type psychology has flourished most 
vigorously in Germany. Several variations and ramifications of type 
theory have been formulated by contemporary German psychologists. 
Jacnsch (cf. 22, 24) proposed a classification of constitutional types 
on the basis of eidetic imagery. The eidetic image is a peculiarly vivid 
and detailed memory image * which is experienced by some individ- 
uals. Eidetic imagery has been found to be most common in late 
childhood and to disappear gradually during adolescence, although it 
has also been discovered among some adults. The eidetic image may 
be a photographic replica of the original object, or it may differ from 
the latter in certain characteristic ways. Jaenseb recognized two types 
of eidetic individuals. In the first type, the image can be called up, ban- 
ished, and altered voluntarily. The eidetic image in such cases may 
be nothing more than a “visualized idea’' and it is accepted as natural 
and normal by the individual. In tlie second type, the image usually 
arises spontaneously and may persevere in spite of efforts to banish 
it; voluntary alterations in the qualities of the image are often impos- 
vsible. Such images do not come up very frequently, and are often 
regarded as unpleasant and even uncanny by the subject. 

Jaensch considered these two eidetic types to be distinct constitu- 
tional types, differing in many bodily and psychological traits and 
characterized by basically dissimilar “psychophysical reaction sys- 
tems/’ The cidctic characteristics were simply taken as convenient 
starting points in the classification. The first of the two types described 
above was designated the B-type, because of its alleged resemblance to 
the Basedow syndrome."’ and the second, the T-type, owing to the simi- 
larity of some of its manifestations to the condition of tetany.^ 

Jung’s introvert and c.xtrovert types arc well known (19). Jung 
maintained that in the extrovert the “psychic energy’’ is turned out- 
ward to the objective environment: in the introvert, it is turned inward 

Eiactic ima^^cs have usually been inveslisalcd in the visual field, although il has 
been claimed lhal they arc equally common i • ‘-thet senses. 

'‘A coiulilion characlcn/cd b> prominence ol the eyeballs, cnlaigemcnt of the 
thyroid gland, muscular tremors, rapid heart action, and more oi less profound 
mental disturbance: believed to be cMiised by ovcraclivily of the thyroid gland. 

® A motor disorder, including muscular trcmoi, muscular spasms, and sometimes 
uncoordinated muscular conli actions iollowing upon an effort to make a voluntary 
movement; attributed to insuthcicnl secretion of the paiathyrold gland. 
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lo a subjective world. The extrovert is predominantly oriented in all 
his actions, thoughts, interests, and feelings by the objects and people 
about him. His beliefs and opinions arc guided by the mores of his 
group. I hc introvert, on the other hand, is governed by subjective 
factors; all his behavior has a subjective, inner reference. Jung re- 
gards these two types as fiindamental biological contrasts. They denote 
for him basic altitudes which characterize all aspects of the individ- 
ual’s psychological make-up. 

Jung's types have beccmic more widely kno^n, however, in terms of 
their emotional and social manifestations. Thus the introvert is usually 
thought of as an emotionally shut-in individual who shuns social con- 
tacts. prefers to work alone, and finds more pleasure in imaginative 
work than in a life of action. I'he extrovert suggests the “salesman” 
type, who meets people easily, is happiest in a social situation, and is 
friendly and interested in his fellow-beings. Jung regards introversion 
and cxtro\'ersion as characterizations of normal people. In extreme 
forms, to be sure, they would predispose the individual to mental 
disorders which arc opposite in their symptoms.'^ The fundamental 
distinction, however, is not made on the basis of these mental dis- 
orders. The susceptihility to one or the other form of insfmity is 
considered simply another manifestation of the basic type. 

Mention may also be made of Spranger’s (cf. 41) description of 
six fundamental types of indivicltiality, including the theoretical, eco- 
nomic, aesthetic, social, political, and religious. These “types'' are 
regarded as mcaning-teiidencies or values in terms of which an indi- 
viduaTs responses to his environment are to he understood. They are 
ideal types or schemata of understanding, rather than empirically 
observable types. 

Kretschmer's type theory (25) has undoubtedly been one of the 
most infiuential in stimulating research. Physically, Kretschmer 
classifies individuals into four groups, the pyknic, athletic, Icptosome, 
and dysplastic. I'he pyknic type of b(xJy build is short and thick-set 
with relatively large trunk and short legs, round chest, rounded shoul- 
ders, and short hands and feet. Fhc ai/iletir has a more proportionate 
development of trunk and limbs, well-developed bones and muscles, 
wide shoulders, and large hands and feet. The lepiosonie is generally 

'’’ This clistinctjon was emphasiVed by McOougall, who wrote; “. , . pcisons of 
the extrovert tcnipcranienl seem rnotc lialdc, under siiinn, to disorder of the hyslciic 
or dissociative type, those of introvert, or shul-in, temperament to disorder ot tlie 
neurasthenic type” ( 28 , p. 28 ). 
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characterized by small body volume in relation to height. He is tall 
and slender, with relatively narrow chest, long legs, elongated face, 
and long, narrow hands and feet. In the dysplastic category arc placed 
all individuals who present an incompatible mixture of type character- 
istics in their physical development. Kretschmer suggested a wide 
variety of physical measures, to be used m conjunction with the clin- 
ical diagnosis of the experimenter, for differentiating between these 
bodily types. 

The basic contention of Kretschmer’s theory is that a relationship 
exists between the body types which he describes and two essentially 
opposed '‘temperaments," the (\\loul and the sdiizoid. 7'he cycloid 
individual manifests personality traits which in extreme cases would 
be classified under the cyclical, or manic-depressive, form of insanity. 
I’he schizoid tends toward schizophrenia, which is characterized by 
extreme introversion and lack of interest in one's surroundings. 
Kretschmer claims that the cycloid is usually pyknic, w'hereas the 
scliizoid is Icptosome or, less frecjucnlly, athletic. Although originally 
applied to dillcrent I'orms of mental disorders, this lhei>ry was subse- 
quently extended to include normal individuals who manifest no per- 
somility distuibancc. The terms "cyclothymc” and “schizothyme” were 
devised to denote these two normal biotypes, T he former is described 
as social, friendly, lively, practical, and realistic; the latter as quiet 
and reserved, more solitary, timid, and shut-in. It will be noted that 
these descriptions correspond quite closely to Jung’s extrovert and 
introvert t v pcs. 

The latest revival of interest in typology followed the proposal of 
a somewhat different approach by Sheldon and his collaborators (39, 
40) in this cunmiry during the early 19-h)’s. This is not a type theory 
in one sense of the term, since individuals are regarded as falling 
along a continuous distribution in both bodily and psychological 
characteristics. What Sheldon argued for was a fundamental and prob- 
ably innate relationship between body build ard personality, neither 
of which need fall into distinct categories. We might say that this 
theory has retained tlic “constilulionai concept but dropped the 
“type" concept of the traditional typologies. A fuller discussion of 
Sheldon’s theory, together v\ith an analysis of the evidence for it, has 
been reserved for a later section ol the present chapter. 

Throughout the various type theories which have been described, 
we can delect a general dichotomy between two opposed constitutional 
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types. From the standpoint of physique, the distinction is one between 
the long narrow body, with relatively long limbs, and the short stocky 
build, with relatively large trunk and short limbs. In respect to person- 
ality, we are offered at the one extreme the expansive, sociable, easy- 
going, and practical man, and at the other the more taciturn, 
unsociable, intellectually independent, or idealistic type. Occasionally, 
more than two categories are given, but the additional types are 
usually found to be either intermediate degrees or modifications of the 
major ones. 

In some theories, the structural classification is emphasized; in 
others the behavioral one is foremost. Many of the theories draw upon 
pathological conditions either for striking examples or for their basic 
concepts. Thus we frequently find susceptibility to a given class of 
physical or mental disorders as an outstanding charactcrisiic of each 
type. In many cases, loo, the various physical and personality types 
have been linked with race, and attempts have been made to attribute 
racial differences to the predominance of one or another constitutional 
type within each race.^ 

THE LOGIC OF ( ONSl ITOTrONAL TYPFLS 

Multimodal Distribution, Type theories have been most commonly 
criticized because of their attempts to classify individuals into sharply 
divided categories. Such a procedure would imply a multimodal dis- 
tribution of traits. The intioverts, for example, would be expected to 
cluster at one end of the scale, the extroverts at the other end, and 
the point of demarcation between them would be clearly apparent. 
Actual measurement, however, reveals a unimodal distribution of all 
traits, which closely resembles the bell-shaped normal curve (cf, 
Ch. 3). Moreover, it i‘; often difficult to classify a given individual 
definitely into one type or the other. The typologists, when confronted 
with this difficulty, have frequently proposed intermediate or “mixed” 
types to bridge the gap between the extremes. Thus .Tung suggested an 
ambivert type which manifests neither introvert nor extrovert tend- 
encies to a predominant degree. Observation seems to show, however, 
that the ambivert category is the largest, and the decided introverts 
and extroverts arc relatively rare. The curve, too, has no clear breaks, 

® CL, e.g., Weidenrcich (45). For a further treutment of ibis application of 
typology, sec Chapter 20, 
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but only a continuous gradation from the mean to the two extremes. 
As was indicated in Chapter 3, this general type of distribution has 
been found in practically all measurable trails of the individual, 
whether social, emotional, intellectual, or physical. 

It is apparent, then, that in so far as type theories imply the classifi- 
cation of individuals into clear-cut classes in cither physique or per- 
sonality, they do not fit the facts. Such an assumption, however, is not 
necessarily inherent in all systems of human typology. It is more 
characteristic of the popular versions and adaptations ol‘ type theories 
than of the original concepts themselves. To be sure, type psycholo- 
gists have often attempted to categorize individuals, but this was not 
an indispensable part of their theories; their concepts have occasionally 
been sufiiciently modilied to permit a normal distribution of traits. 

It has been suggested, for example, that types may refer simply to 
original varieties, breeds, or “biotypes” of man (cf. 20). d hrough suc- 
cessive generations of interbreeding, it has been argued, mixed types 
have been produced which now outnumber the remaining specimens 
of pure types. It is well known that, through the mechanism of hered- 
ity, interbreeding will in the long run produce a larger number of 
mixed than pure individuals. The same could apply to interbreeding 
among the proposed human biotypes. 1'his situation would then pre- 
sent a normal distribution of traits, with the largest number of indi- 
viduals in the center of the distribution, corresponding to the numeri- 
cally largest “mixed” group. Thus the form of the distribution curve 
cannot in itself indicate the composition of the group. The normal 
curve might be obtained with a single intermediate type and minor 
deviations from it, or it might result from the mixture of several pure 
biotypes.*-^ 

Constitutional Relationships. The only essential implication in 
the concept of “biotypes” seems to be a certain organization among 
the various characteristics of the individual, d'hus a relationship would 
be expected between body build, emotional reactions, and intellectual 
traits. If there exist diverse biological types of man, each manifesting 
its own peculiarities in physique, pcrsoizdity, and intellect, we should 
find a certain degree of conformity among these characteristics of the 
individual. When so conceived, the problem of types is ultimately 
reducible to a consideration of the relationship between structural and 

•’For a more technical analysis of the lopic of dilTeient typologies, cf. Win- 
throp (47, 48, 47). 
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behavioral qualities. It is not, however, concerned with isolated traits, 
but with the composite picture of the individual as a whole. 

Methodolog[y of Constitutional Studies. How can such theories of 
constitutional typology be tested? One method involves the classifica- 
tion of individuals into extreme behavior groups and the subsequent 
comparison of these groups in regard to physique. This technique has 
been employed largely with abnormal cases in the ell'ort to check the 
assertions that a given physique predisposes the individual to a certain 
kind of mental disorder. Thus, for example, the relative number of 
pyknics and Icptosomes among individuals manifesting dilferent forms 
of insanity has been compared and evaluated in terms of the expected 
association. 

A second method is based upon the correlational analysis of meas- 
urements collected on unseicctcd normal groups. Various physical 
indices of bod\ build have been worked out for this purpose. Sucli 
indices are then correlated with test scores or ratings on crucial per- 
sonality traits. A high correlation would be evidence lor the con- 
formity implied by type theories. 

In a few studies, which illustrate a third approach, efforts have been 
made to identify and select ‘f)iire types" on the basis of physical cri- 
teria; the psyclufiogical characteristics of the selected individuals arc 
then thoroughly investigated. I’he subjects arc originally chosen so as 
to represent “good specimens” of each type. J'hese physically con- 
trasted groups are then compared in emotional and intellecliial reac- 
tions. This method is in a sense the opposite of the first method 
described above, which began with psychologically contra" ted grt>Lips 
and proceeded lo compare them in physique. 1 he ])resenl method 
starts with groups clearly differentiated in physic|ue and compares 
them in behavior. It should be noted, however, that the present method 
does not merely choose individuals who represent extremes in any one 
phvsical eharaeteristie, such as height or weight. It is an essential fea- 
ture of this method that individuals arc clioscii on the basis of a 
composite of physical specifications so as to fit the particular type 
pattern. 

A foiirtn and more recently developed approacli is to identify first 
tliL basic components of both plnsiquc and personality. These com- 
ponents constitute tiie categories in terms of which each individual is 
described or “typed” in both body build and behavior characteristics. 
The correlation between each individual's physical and psychological 



The Quest for Constitutional Types 


431 


status can then be found. This approach differs from the second one 
described above only in its emphasis upon what is to be correlated. 
The argument is that the correlations — or lack of correlations — here- 
tofore found between physique and personality may be misleading 
because inadequate, superficial, or unessential aspects of both phy- 
sique and personality were measured. The principal efforts of this 
approach arc thus concentrated on discovering the basic categories in 
terms of which both the domain of body build and the domain of 
behavior can be described. 

It should be noted that the differences among these four approaches 
are differences in emphasis rather than in long-range objectives. All 
are fundamentally concerned with the relationship between structural 
and behavioral characteristics. In the sections which follow, the inves- 
tigations have been grouped under these four approaches primarily 
for convenience of piescntation. Moreover, the order in which these 
four methods have been treated in the present section, as well as in 
the remainder of the chapter, is a chronological rather than a logical 
one. In terms of similarity iff' procedure, the first and third approaches 
might have been considered together, and the second and fourth 
ccHild have been similarly grouped. 

EVIDENCE FROM ABNORMAL CASES 

Kretschmer originally formulated his theory of constitutional types 
from observations on psychotv' padents. In comparing the body build 
of schizophrenics and manic-depressives, he consistently found a 
greater proportion ol leptosomes among the former and pyknics 
among the latter. In one survey, Kretscluncr (26) compiled data from 
several investigators on over 4()()() abnormal cases, with the results 
shown in Table 19. ll is apparent that by far the largest percentage 
of schizophrenics fall into the Icptosome and athletic categories, and 
an equally large percentage of manic-depressiver fall into the pyknic 
and mixed pyknic classes. 

Wertheimer and Hesketh (46) measu?-cd 65 male patients chosen 
at random from two American institutions for the insane. Of these, 
1 1 had been clearly diagnosed as manic-depressive and 23 as schizo- 
phrenic. The major part of the investigation was therefore confined to 
these cases. Such subjects were first classified into Kretschmer’s body 
types on the basis of general observation. A scries of 53 antliropo- 
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TABLE 19 Per Cent of Schizophrenics and Manic-Depressives Falling 
into Different Categories of Body Type 


(1 nmi Kriisi Inner, 26, n 34; 


Body 7 ype 

S( lu::ophr( me y 

MautL-Dcpiessix c\ 

Pyknic and iriixcd pvknic 

12 8 

66 7 

1 eptosome and athletic 

66 0 

2^ 6 

D>splaslic 

1 1 

04 

Llnclassifiable 

9.9 

9.3 


mclric measurements were tlien taken and various bodily indices com- 
puted. One of these indices was ultimately selected as the most 
satisfactory and adopted as the chief basis of classification. A close 
correspondence was found between the two procedures Those indi- 
viduals classified as pyknic by the experimenter's diagnosis invariably 
had indices under 255; those classified as leptosomes had indices over 
270. There was no overlapping in the indices of these two groups. By 
Cither method of classification, however, the number ol decided p>k- 
nics or leptosomes was small, most individuals tailing into the interme- 
diate athletic or mixed groups, as would be expected. 

TABI.E 20 lutthet Data on Pit Cent (>j Schizophrenics and Manic- 
Deptessivcs I ailing into Different Categinus e/ Bodv l\pc 
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Mann Depn 
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Pvknic 

43 
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Pyknoid 

no 

36 4 

Athletic 

26 1 

9.0 

I cptosomc-athklic-inixcd 

34.8 

0. 

I eptosome 

174 

0 

Unclear 

4.3 

9.0 


The percentages of persons of each body type found in the schizo- 
phrenic and manic-depressive groups are given in Table 20. These 
data again show a marked predominance of pyknic types among the 
manic-depressives. The schizophrenics scatter over a wide variety of 

10 Index- leg length > 

tiansveisc chest diameter > sagittal chest diameter \ tiiink height 
(cf. 46, p 415). 
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body type, but the greatest number fall into the Icptosomc and ath- 
letic groups. 

The chief diiliculty in interpreting the results of these and similar 
investigations on psychotic cases arises from an inadequate control of 
the a^e factor. Schizophrenia is more common among younger sub- 
jects, whereas older people arc more susceptible to manic-depressive 
psychoses. It is also a well-established fact, which Kretschmer himself 
recognized, that older subjects tend more to\\^ard the pyknic body 
build, younger subjects toward the leptosome. To be sure, pyknics 
may be found among young people, and leptosomes among older 
groups; and many individuals retain the same type of body build 
throughout life. But the general trend is suHiciently marked to produce 
an entirely spurious relationship between body build and psychotic 
tendencies. this reason, it is essential that age dilTcrences be ruled 
out in any comparison of the body type of diiferent psychotic groups. 

In an investigation by Garvey (14), 130 manic-depressives and 130 
schizophrenics were selected so that the two groups were closely 
matched in age. Only clear cases, classified with complete agreement 
by the hospital stall (not including the experimenter), were employed. 
When the patients were divided into heavy and slender types on the 
basis of general observation, some evidence was found for Kretschmer’s 
claims. The association, however, is reported as too slight to permit 
body type to be regarded as diagnostic of psychosis. Extensive physi- 
cal measurements were taken and several ratios between horizontal 
and vertical bodily dimensions were computed. All showed an almost 
(omplctc overlapping of the two psychotic groups. Not only were the 
averages closely similar, but also the range and the general form of the 
distribution were practically identical iu the two groups. 

Naccarati (31), in an elTort to check upon Viola's hypothesis, 
measured 100 male Italian psychoneurotics between the ages of 25 
and 40, The number of normosplanchnics is reported as being smaller 
in this group than in normal groups. The neurasthenics had a larger 
proportion of microsplanchnics (long, slender type), while macro- 
splanchnics predominated among the emotional psychoneurotics.” 
Under the latter category Naccarati included cases of hysteria, anxiety 
neuroses, and traumatic neuroses. Averages of some of the most 
significant physical measurements as well as the average age of the 
two groups are given in Table 21. It will be noted that the neuras- 
thenic group has a lower average age than the “emotional psychoneu- 
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rotics.” This might account in part for the greater tendency to micro- 
splanchny among the former. No description is given of the method 
of obtaining or diagnosing the subjects, a fact which makes interpreta- 
tion of the findings difficult. 


TABLE 21 Bodily Charactcnsiics of Individuals with Different 
Forms of Psychoneuroses 


( P'rotn X.u'i 

.11 .ill, 31, p 5 t.O 



G.oup 

Total 

of Tnnik 

Lctii^tli of 
Fxijcnntu’s' 


50 Ncur;islhcnics 456.64 

50 “Emotional psycho- 

30.43 

133.35 

32.16 

neurotics" 362.06 

37.36 

128.80 

33.94 


An extensive investigation on the relationship between body type and 
psychosis was conducted by Burchard (2). A total of 407 white male 
patients from several institutions for the insane were selected for the 
survey. Of these, 125 were clearly diagnosed as schizophrene^ by the 
hospital staff, and 125 as manic-depressives. The remaining 157 
patients manifested a variety of psychotic and neurotic conditions, and 
were employed as a control group. The subjects in all three groups 
were classified into pyknics, athletics, and leplosomcs by “general 
impression.’' Comparisons were also subsequently made in respect to 
several anthropometric measures and indices. Only seven dysplastics 
were found in the entire sampling, and these were eliminated from 
further consideration. All other subjects were retained, any interme- 
diate or mixed types being assigned to the morphological type which 
they resembled most closely. In I'ablc 22 arc given the percentages 
of pyknics, athletics, and Icptosomes found in the manic-depressive, 
schizophrenic, and control groups, respectively, when the inspectional 
method of classification was employed. 

The general trend of these figures seems to be in agreement with 
Kretschmer s theory. Not only are the greatest percentage of manic- 
depressives pyknic, and the greatest percentage of schizophrenes 
leptosome, but the control group occupies a position intermediate be- 
tween these two groups in all percentages. When the schizophrenes 
and manic-depressives are compared in terms of anthropometric 
measures, a certain amount of differentiation is also revealed. Reliable 
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TABLE 22 Per Cent of Schizophrenic, Manic-Depressive, and Control 
Subjects Showing Pyknic, Athletic, and Lepiosonie Body Types 

(I'toiii liiiuliar*!, 2 , ]>. 31) 


h'torplioloy^ical 

7 ype 

Manu - 

D( PK'WIVCS 

Schizophrenics 

( omroi 

Pyknic 

2 

36 3 

53 6 

Allilclic 

8 8 

177 

11.3 

Leplosomc 

28.0 

46.0 

33.1 


difl'crcnccs between the avcnigcs of the two groups were found in 
three out of nine physical measures and in two out of three bodily 
indices. Nevertheless, the overlapping of the groups in ail tliesc mcas- 



Fig. 71. Fro quenev Distribution of 125 Manic Dcprcssivcs and 125 
vSchizoplirenes on the Werihcimcr-Ucsketh Index ol Body Binld. (From 
Bui chard, 2. p. 47.) 

ures w^as very large. This is illustrated in Figure 71, which shows the 
frequency distiibutions on the Wcr heimcr-Ucsketh index of body 
build.’^ dhis index yielded the largest dillereuccs between the two 
groups. It is appaierit that, despite the statistically ?>igni{icant diller- 
ences in averai.’^es, schizophrenes can be found vJio arc much more 
pyknic than certain manic-depressi vcs, and vice versa. 

Lven the dil;erences in averages between the two groups may be 
the result of other uneoniroiled factor Hureluird recognized this dilli- 
eulty and undertook a detailed analysis ol his manic-depressive and 
schizophrcnc groups. In rctu'^^d to racial and national background, 
occupation, and educational status, iii'i appreciable or consistent dif- 
ferences could be discovered. In age, however, the diilereiiees w'crc 


'*Cr. (ot^lncUe la 
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very large, the average ages of schizophrenic, control, and manic- 
depressive groups being 30.97, 42.90, and 49.65 years, respectively. 
Further analysis revealed a definite relationship between age and body 
build. This factor seems to have accounted largely, although not 
entirely, for the group differences found. 

Since the age factor plays such an important part in all studies on 
constitutional type, we may examine more closely Burchard’s data on 
this problem. In Table 23 will be found the average Wertheimer- 
Hesketh indices of subjects falling in vsuccessive decades, within the 
entire sampling as well as within each psychotic group. These averages 
indicate a definite tendency toward a more pyknic body build with 
advancing age. This is manifested within each psychotic group, as well 
as in the entire group. Further corroboration of this finding is fur- 
nished by the correlation of -.256 obtained between age and index 


TABLE 23 Wcrtiiciiticr-Hcsketh Index in Relation to Aite and lype 
of Psychosis 

(T'ro'n l)UrdiaT>l, 2 , p (it) 


Ape 

Fntire (Jmup 

Mean W’cUhcimei-flcau’lh Index 

Schi/ophrencs Matuc-Depi cssives 

Contiol 

15- 19 

3or> 1 1 

297.25 

262.66 

321.00 

20-29 

275.10 

279.77 

252.00 

273.48 

30-39 

260.82 

272.00 

256.33 

253.86 

40 -49 

249.34 

2''2.50 

246.52 ^ 

249.41 

5a- 59 

25.3.68 

277.50 

247.29 

257.16 

00-69 

236.50 

243.3 3 

241.67 

228.75 


value in the entire sampling. Much of the difference observed between 
the two psychotic groups can therefore be attributed to age. It should 
be noted, however, that within each decade the sehizophrcncs have 
a higher average index than the manic-depressives. To be sure, the 
differences arc considerably reduced by ruling out age, and the control 
group no longer retains its intermediate position, but a certain differ- 
ence in the expected direction remains. This difference could possibly 
have resulted from other unsuspected factors in which the two psy- 
chotic groups may not have been equated. Or it may indicate an 
actual, although very slight, relationship between body build and type 
of psychosis. 
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CORRELATIONAL STUDIES WITH NORMAL GROUPS 

It has frequently been objected that one cannot generalize from a 
slight correspondence between body build and certain forms of insan- 
ity to a relationship betw^een structural and personality traits of normal 
individuals. The comparison of average values, furthermore, or of the 
percentage frequency of bodily types among diiYcrcnt groups may ex- 
aggerate a very slight degree oi associatjon. Such comparisons tell us 
little about individual cases. For these reasons, a number of investi- 
gators have resorted to the correlation cocllicicnt to obtain an exact 
quantitative measure of the amount of relationship within a group. 

The correlation coenieient is alTected not only by the presence or 
absence o\' clear-cut types within a group, but also by the de<^ree to 
which a given typal characteristic is exhibited. This method seems to 
rest upon a slightly diilcrent principle than that underlying group com- 
parisons. Thus if morphological index were found to correlate highly 
with intelligence, it would mean not only that the clearly micro- 
splanchnic are more intelligent than the clearly macrosplanchnic, but 
also that, within the intervening range, the more microsplanchnic the 
individual, the more intelligent he will be. A lack of relationship be- 
tween intelligence and body build within the intermediate mixed 
grcHips W'ill considerably low^r the correlation W'hich would be ob- 
tained if only ‘'pure Upes" w^cre included. 

Let us examine some facts and figures. Naccarati (30) found a 
correlation of .356 between morphologic index and intelligence test 
scores within a group of 75 college men. Jn the same study, the height- 
w'eight ratios^- of 221 college men ranging in age from 17 to 22 
correlated .230 W'lth intelligence. At erst glance, these slight positive 
correlations betw'cen heieht-weight ratio (or morphologic index) and 
intelligence would seem to support the claim that the tall, slender 
individual tends to be more intelligent. The age factor, however, must 
again be considered. Upon further statistical analysis of the data, it 
was discovered that the correlation of ,230 icsuked largely from a 
negative correlation between u c/e/zf ai.d infclli<^cnce test score w^ithin 
this group.’" I hc more heavily built, stocky individuals at the age 

‘“The height weight iLilio has frequently been substituted for the more elaborate 
morphologic index, lor the sake ot expediency, since the two indices are closely 
Jclatcd. Naccarati i30), for example, found a (.(-rrclat'on of .70 betueen the two in a 
group of 7^ students, and a co''reKi(ion of . 1 ^ m another group of 50. 

^"Subsequently computed by Hull (17, pp. 142-143), from Naccarati’s published 
data. 
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levels under consideration tend to be the older members of the group. 
Similarly, the older individuals within any one academic level are 
usually the duller ones. It therefore seems very likely that even the 
low degree of correspondence found between height-weight ratio and 
intelligence is attributable to an uncontrolled age factor and cannot be 
accepted as proof of a relationship between body type and mentality. 

In a subsequent investigation by Heidbrcder (16), Naccarati’s 
hypothesis was checked on a group of 1000 white, native-born college 
freshmen, including 500 men and 500 women. The correlations be- 
tween intelligence test scores and height-weight ratios proved to be 
only .03 for the men and .04 for the women. Similarly, the correla- 
tions between height-weight ratios and scores on each of the five sub- 
tests of the intelligence examination closely approximated zero, rang- 
ing from -".07 to t'.l^k 

In an effort to discover whether the use of the more elaborate mor- 
phologic index in place of the height-weight ratio might yield more 
positive evidence for Naccarali’s view, Sheldon (36) conducted an 
intensive investigation on 434 freshman men, between the ages of 
17 and 22. Twelve measurements were carefully made on e;ieh indi- 
vidual and from them was computed the morphologic index, in the 
manner described by Naccarati. The c(>rrclation between these indices 
and scores on a common group jntelligencc test for college freshmen 
was .14. Correlations of the morphologic index with each of the nine 
sub-tests in the cxaminrlion ranged from - ,02 to 1.12. These findings 
corroborate closely tliose obtained by Heidbrcder with the height- 
weight ratio. 

In a rurther investigation of morphologic types, Sheldon (37) cor- 
related morphologic index and ratings on five personality traits wdthin 
a group ('f 155 freshman men. Hach student was rated by five upper- 
classmen wiio belonged to the same fraternity as the subject. The 
judges had thus had considerable opportunity to observe the studenfs 
everyday behavior in many situations and were fairly well qualified 
to rate him. Ihc consensus of all five judges was taken as the final 
rating for each individual. Below' will be found the correlations be- 
tween morphologic index and ratings on each trait: 


Emotional excitability 

.00 

Aggie.-iSivcncss 

-.08 

l^cadcrship 

-.14 

Sociability 

— .22 

Pcrsevci a nee 

!oi 
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On the whole, these correlations are too low to indicate an appreci- 
able degree of relationship between bodily type and personality traits. 
The correlations of morphologic index with leadership and sociability 
arc, however, suggestive. 1 hese two correlations indicate a tendency 
for the more heavily built individual to be more sociable and more of 
a leader. 1 his could again result in part friuii an uncontrolled ane 
factor, inasmuch as the older individuals within such a group might 
well be expected to manifest these characteristics. 

A comprehensive investigation including both intellectual and per- 
sonality trails was conducted by Garrett and Kellogg (13). llie sub- 
jects were again male college freshmen. Morphologic indices were 
ccanputcd with measurements taken from three standard {)hotographs 
of each subject. These photographs, taken in connection with gym- 
nasium routine, showed three diifcrcnt views of the individual in the 
nude. The morphologic indices computed from the phwhagraphs corre- 
lated .81 with height-weigiU ratios obtained from direct measurements 
on 219 students. On this basis, the authors felt justitied in their use 
of the photographs for the sake of expediency. 1 ho ^'photographic" 
morphologic indices, as w^ell as the height-weight ratios from direct 

TABLE 24 CorrelarioHs of Itci^hl-Wci^ht Ratio and Mo*‘ {dialogic 

I'ldcK with iesi Scons 


( i ’'will < i.n i t I ! ,.11' i K < I'''!--.’, i C i>. 1- 1 * 


SIk'i If'dcx 
{ ho/ii ( 'h>((}*uaph \ ) 

II ( H'cipjii Ratio 
( funn dm 1 1 mt UMoc/nt'nts) 

Test Niinilici 

Oi t 

Correl . ion 

Niinilu.-r 
of C 

Correlation 

1 ' ho rnci i kc I n t c I i i iz cnc e 




test 20h 

07 

204 

.10 

WooUworth Shed 15! 

05 

!50 

.09 

Social hi Iciligcneu l est 123 

-.00 

122 

.05 


measurements, were correlated wdth ti vis of intelligence, emotional 
stability, and social aptitude, with the results showm in Table 24. 
None of these correlation? is suHiciently large to indicate a significant 
degree of relationship. Hiiis we must conclude that none of these 
correlational studies on un^elected noimal samplings has provided 
any support for constitutional typology. 
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THE STUDY OF “PURE TYPES” 

Exponents of typology have been quick to object that the low, neg- 
ligible correlations in unselected samplings could result from the pres- 
ence of a large group of individuals of mixed types, in whom no 
consistent relationship between physique and personality may exist. 
These individuals, who arc probably in the majority, would serve to 
“dilute" any clear-cut relationships among the '‘pure types.” It has 
also been argued that even when indices arc employed in lieu of iso- 
lated dimensions, the investigator is not getting a picture of the indi- 
vidual's physique in its totality. And the latter is essential in any con- 
cept of constitutional types. 

Most of the numerous German investigations on types have pro- 
ceeded by sclcctin*^ good specimens of each type on the basis of physi- 
cal measurements or observations and then administering a variety of 
psychological tests to the groups so obtained. By this method, for 
example, the conclusions were reached that pyknics arc more Jis- 
tractablc than Icptosomes, that they have a greater perception span, 
show a better incidental memory, respond “synthetically” rather th;m 
“analytically” to a diflicult perception, are more sensitive to colors 
than to forms, arc superior in motor tasks except when these require 
delicacy of movement, and give more extroverted responses. These 
arc among the major diilercnces reported by Cierman investigators.^ ‘ 
These writers plaec relatively little stress upon differences in general 
intelligence between the types. 

Many of these studies arc open to serious criticism and it is there- 
fore diflicult to CN'aluale their findings. The groups employed were 
usually small. Averages were reported with no indication of varia- 
bility within each group or of amount of overlapping between groups. 
Ouantilali\c data were frequently lacking and only descriptive obser- 
vations reported. The tests were often inadequate or poorly standard- 
ized. The groups themselves, selected chiefly on the basis of physical 
type, frequently differed in other essential respects. Thus the relative 
proportion of men and women may not have been constant in all tlic 
groups. Or the pyknics may have been older than the leptosornes, in 
which case this age difference could account for the observed psycho- 
logical differences. Little or no attempt was made to contiol this age 
factor, in some studies the subjects ranging from adolescents to sexage- 

^■^For a survey of many nf these investigations, see Klinebeig ct al (20). 
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narians. Social and cultural background may also have affected the 
results. There is some evidence, for example, that leptosomes are 
found more commonly among the higher social and educational levels. 
Since there are also intellectual and possibly emotional differences 
from one socio-economic or educational level to another, such factors 
should ‘be held constant. 

In America, Mohr and Gundlach (29) conducted an intensive 
quantitative investigation on a group of male convicts in a state prison. 
A total of 600 men were measured, out of which 89 w'crc selected as 
good representatives of Icpiosome, athletic, and p\knic types. In arriv- 
ing at this classification, the investigators employed all the anthro- 
pu'inetric measures suggested by Kretschmer, as well as a general 
observational diagnosis of body type. Each subject wa:, then given the 
Army Alpha and about a dozen simple psychological tests ^ug<tcsted 
by the German workers as diagnostic of constitutional type. Included 
were such tests as speed of tapping and of writing, visual reaction 
time, cancellation, substitution, color fusion, and Rorschach inkblots. 

A striking difference in average Alpha score was found among the 
three groups. This is shown in I'able 25, together with the number of 
cases in each group and the average ages. The correlation between 

TABLE 25 Mean Intelligence Test Scoies of Adult Leptosomes, 

Athletics, and Pyknics 


( tiotn Mohr Mai <a!ucnuh, J'*. j'l- IWJ 



Body Type 

Narnhcj of Lascs 

Mean Ai'C 

M can 

Alpha Score 


Leptosome 

19 

28 55 

96.5 


Athletic 

26 

28.65 

79.2 

\ 

L. 

Pyknic 

44 

34.75 

51.9 


Alpha score and an index of body build was found to be -.34, which 
further corroborates the above results. Although not very high, this 
correlation indicates a significant tend ’ricy for the tall, slender indi- 
viduals to obtain higher scores. Similarly, in many of the other tests 
the differences among the groups were large enough to be statistically 
significant. It will be noted, however, that there is a marked difference 
in age among the three groups, the pyknics being on the average a 
fittle over six years older than the leptosomes or athletics. In view of 
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the tendency for Alpha scores to decrease with age (cf. Ch. 9), 
therefore, the pyknic group would be expected to obtain lower scores. 
The cultural and racial composition of the three groups is not stated, 
and this factor may also account for some of the observed differences 
in test performance. 

In a later study, Klincbcrg, Asch, and Block (20) undertook to 
compare Kretschmer’s body types under more rigidly controlled con- 



FiS. 72. Distribution of Scores ol Teptosomes and Pvknics on the Pignet 
Index. (I’lorn Klineberg, Asch, and Block, 20, p. ISO.) 

ditions. The study was limited e'xclusivcly to college students, so that 
variations in age and in social and educational level were markedly 
reduced. T he subjects were also very homogeneous in racial and cul- 
tural background. From a group of 153 men in a single college, aver- 
aging 19 years-9 months in age, it Wcis possible to select 56 “pure 
pyknies*’ and 59 “pure leplosorncs.” The classification of body type 
was based upon five indices computed from physical measurements, 
together with the experimenter’s observational diagnosis. That the 
two chosen groups were clearly dilTercntiatcd in physique is illustrated 
by Figure 72. This sliows the distributions of the pyknic and Icpto- 
somc groups in Pignet Index,^^ one of the five criteria of selection 
employed. It will be noted that overlapping is virtually absent. 

In sharp contrast to this distribution is that reproduced in Fig- 
ure 73, showing the scores of leptosomes and pyknics on one of the 

^'“'Pignet Index ~ Height - (weight 4 chest circumference). 

Noil: This formula is printed incorrectly in the study under consideration (20, 
p. 164). We assume this was a mispiint, and that the correct formula was employed 
in the computations. 
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psychological tests, viz., cancellation of letters. In this case, the two 
groups overlap almost completely. Similar results were obtained with 
all the other tests, which included tests of intelligence and of emotional 
adjustment, as well as six tests specifically designed to measure alleged 
characteristics of the two opposed constitutional types. In no case 
were the dilicrcnces between the two groups statistically significant. 
Correlation of measures on 1 10 cases conhnned these findings. Ihc 



I''ig. 73. Di>lnbuliop. of ol l.cptosonios and l^yLnics in Lcttci 

C'anccilatiun. (from Klincheig, Asch, and block. 20, p. ISO.) 

cc^rrelations between pliysieii] indices and test scores were all close 
to zerc>. IntercorrelalHMi of the vaiious psychological tests were also 
negligible. If the unnVrKing conformity implied by type theories were 
pjcscnt, a fairly close correspemOL nee shiHild ha\e been found among 
the \aiious diagnoddc tots. ViCvvcd from aav angle, the results are 
c f > mp le te ly negative.^'’ 

An intensive mxestigation of persm aJity trails in relation to physi- 
cal type was conducted b\ Klineberg, f jeld, and Foley (21). The 

’'* \ pui.illi‘1 s(iicl> A\,i- coiulijctcd on wfunen stiiULMits. but the lesiilts were 
less cnucliisis siiKi' .ill the \M‘nieu Jell wthin the l''j-5ti)siime Kint^e and no j^jenuinc 
pvkn<v.s could be (oniid r •unnji ,^oI 1 ^ v.iihiu iIk leptosome gioup, bt'di liv coircla- 
I'op.il uud bv conU..sied liump nielbods, con oboialed the tuidin‘..s on tlic male 
Miidciits. 
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subjects were again students, selected from several colleges in New 
York City and its environs. A total of 200 men and 229 women were 
examined. Within each of these groups, the subjects who fell in the 
upper and those who fell in the lower 25% of the distribution of 
Pignct Index were selected as leptosomes and pyknics, respectively. 
This gave 50 leptosomes and 50 pyknics among the men, and 57 lepto- 
somes and 57 pyknics among the women. These contrasted physical 
types also showed significant differences in nearly all other physical 
measures and indices obtained in the study, and can safely be regarded 
as distinctly different in physique. Little or no overlapping was found 
in any of these measures. 

As for age, the male pyknics proved to be slightly older than the 
leptosomes, their average ages being 21.08 and 20.17, respectively. 
Besides being very slight, this age difference is such as to exaggerate 
any of the alleged psychological differences between leptosomes and 
pyknics. Hence such an age discrepancy loads the dice slightly in 
favor of Kretschmer's hypothesis and would make negative findings 
all the more conclusive. Among the women, the age difl'erence was 
negligible, the leptosomes averaging 19.73 and the pyknicj 19.23 
years. 

All subjects were given the Bernreuter Personality Inventory, the 
Allport- Vernon Study of Values, and a specially designed test of sug- 
gestibility. A large portion of the group also took two other tests. One 
of these was an honesty test, showing the number of times the subject 
cheated on what seemed to be an information test (Mailer Test of 
Sports and Hobbies), The other was a specially constructed persist- 
ence test which measured the length of time the individual worked on 
an insoluble finger maze before giving up. In Table 26 will be found 
the average scores of both male and female leptosomes and pyknics on 
each test, together walh data on the significance of the differences be- 
tween the averages. 

The results of this carefully controlled study are clearly negative as 
regards type theories. None of the leptosome-pyknic differences, in 
cither male or female group, is significant. In other words, all the ob- 
tained differences could have resulted from chance errors of sampling. 
It should also be observed that in several comparisons the differences 
between leptosomes and pyknics were contrary to expectation. For 
example, the male leptosome group appears more “sociable” on the 
Bernreuter and seems to have a higher sense of “social value” accord- 



TABLE 26 Avcfaqc Scous nt Lcpiosonie iiud Pyknic Groups on Personality Tests 
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Not all subjects were given these tests. 
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ing to the Allport-Vernon scale than does the male pyknic group. 
The average scores of all the groups, furthermore, fell very close to 
the norms for college men and women in general. A final point to note 
is that the ranges of the leptosorne and pyknic groups were nearly 
identical, showing an almost complete overlapping of distributions on 
all personality tests. 

Ttin SEARCH rOR COMPONENTS 01 PHYSIQUE 
AND TEMPERAMENT 

The approach illustrated by the recent contributions of Sheldon and 
his co-workers (39, 40) emphasizes the discovery of basic compo- 
nents of ‘‘physique" and “temperament." This is regarded as a neces- 
sary first step in any investigation of the relationship between struc- 
tural and behavioral characteristics. In Sheldon's schema for classi- 
fying plnsiqucs, llie individual is rated on a 7-point scale in each of 
the three following components: 

( 1 ) Ijulomoi phv — the degree to which “soft roundness'’ pred^nninates. 
A peison lated high (at or near 7) in endomorphy is llahl^y, solt, 
and rolypoh. In such a physique, the digestive viscera are over- 
de^ eloped in relation to other body systems, and the individual has 
a relatively iaige abdominal and thoracic cavity. 

(2) phy — the degree to which bone and muscle predominate. 
1 lie pi\>lcssional “strong man” of the circus would usiuiliy rate 7 
in this component. I'hc distinguishing mark ol mesoinoiphy is 
uprightness and sturdiness of slructuic. 

(3) / emnuirphv — the dcgice U> which linearity and fragilitv predomi- 
nate. dhe extreme ectomorph is “skinny," with long, delicate 
bones and underdeveloped muscles. 

Each individual’s “somatotype" consists of three numbers, repre- 
senting his rating in endomorphy, mesomorphy, and cclomorpby, re- 
spectively. riius a 7 1-1 represents extreme endomorphy. A 2-6-2 
and a 3-6-2 arc both highly mesomorphic, hut the latter sliows mure 
endomorphy than the former. Theoretically, there are 210 somatolypc 
combinations which could be obtained with three components rated 
on a 7-point scale. But some of these combinations arc obviously 
impossible, such as the hypothetical 7 7-7 or 11-1. Sheldon (39) 
describes 76 somatotypes which have been actually observed. In Fig- 
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urc 74 will be found photographs of four somalotypcs, illustrating the 
extremes of endomorphy (7-1-1 Vi), mesomorphy (1-7-1 Vi), and 
ectomorphy ( 1 Vi-l Vi-7), and one physique in w’hich the three com- 
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Fig. 74. Human Somaloi>pcs. (liom Sheldon and Stevens, 39, pp. 8-9.) 


ponents arc nearly in balance (4~3Vi-4). The most common physique 
among the college men who were Sheldon's principal subjects was 
3-4-4. 

One of the principal aJ'. antages of this system of classification over 
earlier constitutional typologies is that it clearly begins with the as- 
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sumption of a continuous distribution, and merely describes the indi- 
vidual in terms of three components or variables rather than one. The 
use of a 7-point scale is of course arbitrary and only a matter of con- 
venience. Five, ten, or any other number of steps could be substituted, 
in which case the total number of somatotypes would be decreased or 
increased. A further advantage which the authors claim for this sys- 
tem is that nutritional and age differences do not alter the individual's 
somatotype. For example, they maintain that although most endo- 
morphs are "‘fat,” loss of weight will not change endomorphs into 
mesomorphs or ectomorphs — “they become simply emaciated endo- 
morphs'’ (39, p. 8). The reason for this consistency of somatotype is 
that several measurements arc taken in parts of the body which change 
little when llcsh is added or lost. In soinatotyping an individual, the 
three components are rated in at least five different bodily regions and 
then averaged. For example, the separate ratings of the 7-1 -Hi 
pictured in Figure 74a were 7-1-2, 7-1-1, 7-1-2, 7-1-2, 7-1 -1. 

The three components of physique were chosen and defined through 
detailed observations of the photographs of 4000 college men. As 
shown in Figure 74, each person was photographed in the ry.ide and 
in a standardized posture, from the frontal, lateral, and dorsal posi- 
tions. Once the three components had been suggested by this inspec- 
tional analysis, suitable anthropometric measures were selected by 
trial and error. The measurements were made directly on the photo- 
graphs with nccdle-poiiil dividers. 

A paralkl proecdure was follow'cd in arriving at the basic compo- 
nents of temperament. First, the authors assembled a list of 650 
alleged temperamental “traits’* described in the literature, most of 
them being related to introvcrsion-extrovcision. After adding a few 
from their own observations, arranging, and condensing, they were 
able to reduce the list to 50 terms which seemed to embody all the 
essential characteristics. A group of 33 young men, mostly graduate 
students and instructors, were then rated on a 7~poinl scale for each of 
these 50 traits. Ihc ratings were based upon a scries of 20 intensive 
mte^vicw^s by the experimenter, extending over a period of one year 
and supplemented by everyday observations. All the 1225 intcrcorre- 
lations among the ratings of the 50 traits were computed. An inspec- 
tion of this correlation table suggested to the authors that the traits 
fell into three principal “clusters,” such that the tests within each 
cluster were positively correlated with each other and negatively cor- 
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related with the tests in the other clusters. At this point it was decided 
to keep only those trails, or items, which had a positive correlation of 
.60 or more with other items within their cluster and a negative corre- 
lation of .30 or more with items outside the cluster. On this basis 22 
of the original 50 trails were retained. In the course of subsequent 
studies on more subjects, the investigators undcrlook to sharpen and 
redefine the initial 22 traits and to add others which also satisfied the 
above correlational criterion. 'The fina* scale developed by this tech- 
nique consisted of 60 traits, 20 in each cluster. 

The tcinperamcntal components represented by each of these three 
clusters were described as follow^s: 

(1) Vi:.iCcroionici — tendency toward ickiKation, ]'.)vc of comfort, socia- 
bility. conviviality, plcasuie in Cuting and in digesting, indiscrimi- 
nate amiabditv, complacent toieraucc. easy emotional e\pression, 
and need of people when troubled. 

(2) Somatotonia — tendency toward assertiveness in posture and 
movement, cneigetic activity, love of power and risk, physical 
courage, diicctnoss ol manner, psychological callousness, general 
noisiness, and need of action vvhen tioubled. Individuals high in 
this trait are charactei i/ed by ‘vigor and push.” 

(3) C’cn‘/>r<;//>.e/i/--tcndcncy toward restraint and tightness in posture 
and movement, love of privacy, fear of people, emotional restraint, 
poor sleep hah its, and need of solitude when troubled. 

How^ arc the components of physique related to the components of 
temperament in this systen ' Siieldtm maintains that endomorphy 
tends to be associated w'llh viscerolonia, mesomorphy with somato- 
tonia, and ectornorphy with ccrcbrotc* .ia. In a group of 200 university 
men between the ages of 1 7 and 3 1, t*L.>crvc(l by Sheldon over a 5-year 
period, the following conclations were found betw'ccn ratings for the 
individual components of physique and temperament: 

I naomorpliv and visccioUniia -7‘> 

Mcsoinoiphy and sonialolonia .82 
bctoniorphy and cereHrotonia .83 

From a further analysis of the same M.ojccts, the authors suggest lire 
hypothesis that certain discrcpamics between somalotype and tem- 
peramental index may predispose the individual to maladjustment and 
interfere v/ith his achievement (39, Ch VII). 

The correlations between structural and temperamental compo- 
nents reported by Sheldon are certainly' much higher than those found 



450 Differential Psychology 


heretofore. Sheldon and his co-workers attribute this dihercnce to 
their own reliance upon ‘'essential underlying components” of both 
physique and temperament, in place of what they regard as the rela- 
tively superficial or fragmentary measures of earlier investigators. 
Sheldon argues, for example, that aptitude or personality tests may 
not reveal the "deeper and more enduring aspects” of temperament 
which he claims to have reached through his series of interviews 
(7, p. 33). For this reason, test scores might not yield such high 
correlations with somaiolvpe as were found by Sheldon through the 
use of ratings. A counter-argumenl is that the well-known “halo 
cllecl" may have operated in the ratings, producing artificially high 
correspondences between physi(}ue and temperamental index. As the 
strongest defense against the halo cfTect, Sheldon otfers the fact that 
the experimenter was aware oi its nature and was therefore on guard 
against it. The ehectiveness ol such a safeguard is of course debatable. 

Subsequent checks on Sheldon's theory have sho\ui that when 
objective test scores were subslituled for personality lalings. the corre- 
lations between somatotype and beha\i('>r characteristics dropped to 
the familiar low values. In a study by Child and Sheldon (7) on groups 
of Harvard men,'” somatotype was c(>rrelated with tests of verbal and 
numerical aptitude, ascendance-submission, and masculinity-feminin- 
ity. These correlations were uniformly low, the highest being .21. Only 
a few were of marginal signihcance, indicating a fair probability of a 
true but very slight relationship, f’iske ( 1 H found no significant asso- 
ciation bct\^ecn wSheldon's somatotype classification and a series of 
intelligence and personality tests among 133 13- to 17~year-old boys. 
In this study, ratings of personality characteristics also tailed to show 
a significant relationship to body build. 

Another serious criticism against the system proposed by Sheldon 
concerns the original identification of the three tempcrii mental com- 
ponents (cf. 1 ). In the last analysis, the entire structure ol evidence 
for the presence of tiicsc particular components stands or ialls witli 
the adequacy of the initial experiment on 33 Harvard men. To be 
sure, subsequent studies were conducted on larger groups. T3ul these 
studies W'crc designed simply to redefine, sharpen, and expand the 
originally chosen list of 22 “traits” for measuring the three tempera- 
mental components, rather than to check the adequacy ol the compo- 

The number of .subjects used for obtuiiiine; the difTcient eorrchilions varied from 
90 to 518. 
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nents themselves. 1 his is clearly indicated by the authors’ procedure. 
The criterion for adding a new trait to the list was that the trait must 
correlate highly and positively with the trails in one of the original 
clusters, and negatively w'ith the traits in tlie other tv\o clusters. The 
subsc(iucnt modilication or addition of traits thus depended in a very 
intimate waiy upon the results of the initial experiment. The small 
number and highly unrepresentative nature of the subjects employed 
in this initial experiment make it ill-suited to play such a fundamen- 
tal part in the development of the entire sehema of temperament 
classilieation. 

Finally, the technique of identifying components by inspection of 
a correlation table leaves loo much to subjective judgment. In so far 
as the major contribution of Sheldon's approach is its emphasis upon 
components rather than types, the best available objective tLclinic[ucs 
for identilying such comptments ought to be applied. '\ hesc tech- 
niques, know'n collectively as ‘Tactor analysis/' are based upon fur- 
ther statistical analysis of the table of intercorrelalions, and will be 
discussed more fully in Chapter 15, In the present connection it will 
siilhce to note that other investigators have begun to employ these 
techniques in the analysis of body build as well as personality charac- 
teristics, with results which oiler little support to SlieldoiCs tripar- 
tite classilieation. 

Fyril Burt (3, 4, 5), who first applied factor analysis to body build, 
has studied several groups difl'ering in age, sex, and national back- 
ground. The largest of these .amiplings cimslsled v)f 2400 British air- 
men in the KAF. Factor analyses were conducted on as many as 17 
bodily dimensions, although >omo f Ihe studies empk>yed fewer 
measures. Such investigations by Burt and his sludeiits have indicated 
the presence of a general ‘'size’’ fact(^r. present to some degree in all 
bodily measurements and varying independently of type of physique. 
A second, bipolar or “type” factor was also found, which appeared to 
be related to the tendenc}' toward a linear )i a broad body build. 
This factor was positively related to measures of length, and nega- 
tively related to breadth, depth, and e.icumfercntial measures.^" An 
index of body build based upon this second, bipolar, factor was found 


In a nioic dclculcd and Tcfincd factor anrd sis of some of these data, Thuistone 
(44) identdied seven dillcrtiil b.c.ors. but these otlei no rnuic si pport to the Sheldon 
elassitieation than do the oruunal tactois identified b> Burt and his students, bor 
other applications of (actor analysis to body budd, cf. 3, 4. 5, 15, 33, 34, 43. 
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to be normally distributed and to exhibit no tendency toward distinct 
types. Preliminary exploration of the relationship between this index 
and personality characteristics in both normal and abnormal groups 
showed a very slight but consistent trend along the lines suggested by 
Kretschmer and others (3, 34). Thus the “long-lean” person tended 
to be more inhibited and repressed, and more prone to schizophrenia 
and allied disorders; the more “rotund” individual inclined more to- 
ward cheerful, sociable reactions and toward cyclical forms of insanity. 
But again it should be repeated that these relationships were so slight 
as to be of theoretical interest only. 

CONSTITUTIONAL TYPE OR SOC lAL STLRTOTYPi:? 

Although too low' to be of practical diagnostic value, the correlations 
between body build and certain personality characteristics seem to be 
sufficiently consistent to merit some consideration. Do these correla- 
tions mean that common, underlying “constitutional factors'" exert at 
least a modicum of inffuence upon the development of both behavior 
and physique? Perhaps, but there is also another explanation. The 
differences in peisonality may have developed as a response to the 
differential treatment accorded to individuals of different physitjue by 
their social environment (cf. 6, 7). The existence of social stereotypes 
creates a vicious circle which tends to perpetuate wliatevcr beliefs may 
be current regarding the association of physical and psychological 
traits. 

Some of the effects of the individuars physique upon his social 
environment may be more direct and need not imply social stereo- 
types. This is cs])ccially true in childhood, when physical size, mus- 
cular prowess and strength, agility, and other characteristics associated 
with physique may inffuence social interactions and status. It has been 
repeatedly showm, for example, that adolescent leaders tend to be 
taller and heavier than their associates (cf. 8, pp. 248-249). More- 
over, high school leaders are more libel) to become leaders in college 
than those who wxtc not leaders in high school, and the same indi- 
viduals are also more likely to be community leaders after leaving 
school (9). It will be recalled, also, that in a previously cited study on 
fraternity men (cf. p. 438), the highest correlations found with mor- 
phological index — and the only ones approximating statistical signifi- 
cance — were those of ratings for leadership and sociability. These 



453 


The Quest for Constitutional Types 

correlations showed a tendency for the more heavily built man to be 
regarded as more sociable and more of a leader by his fraternity 
brothers. In the more recent study by Child and Sheldon (7), dis- 
cussed in the preceding section, one of the highest correlations was 
between dominance scores and mesomorphy. High ratings in meso- 
morphy indicate a tendency toward overdevelopment of muscle and 
bone, suggesting the sturdy, athletic physique. 

In an investigation designed to test the Kretschmer theory, Cabot 
(6) obtained an extensive series of test scores, ratings, and interview 
data on 212 high school boys. Within this group, 9 boys were clearly 
and consistently classified as pyknics, 25 as athletics, and 28 as lep- 
tosomes. The principal comparisons were made among these three 
groups. An examination of the personality measures showed that the 
largest dilferenccs occurred between the athletics, on the one hand, 
and the pyknics and leptosomcs, on the other. The athletics tended to 
be more dominant and extroverted, and more active as social leaders. 
They were also rated higher in creativencss, imagination, responsibil- 
ity, and influence on their associates. On this basis, Cabot proposed a 
theory of ‘’socio-biological advantage,'’ according to which a ‘‘good” 
physique (e.g., athletic or mesomorphic) gives the individual certain 
advantages in his social environment. It should be noted, in further 
explanation of Cabot's results, that a fairly signilicanl diticrence in 
socio-economic level also favored the athletic group in his sampling. 

In summary, then, the slight correlations found between body build 
and psychological characteristics in various studies are of the order 
of <ill other structural-behaviornl correlations reported in the preced- 
ing chapter. And they are subject to the same variety of explanations. 
We have seen that, beginning with the sharply differentiated classifi- 
cations of traditional type theories, typologists progressed through 
the “eonstitutional” concepts of relationships between structural and 
behavioral patterns, to the underlying question of the organization of 
all traits within the individual. It is to this basic question that we shall 
direct ourselves in the two chapters which follow. 




CHAPTER 

14 


Vartahilthf ici thin 
the Individual 


The study of variatkws from trait to trait within the individual is 
of both practical importance and iheorclical significance. When a 
child is classified as intclleclually inferior on the basis of, let us say, 
Stanford-Binet IQ, there is still much that remains to be known about 
his abilities. Is he equally inferior in all respects, or docs he exhibit 
significant discrepancies in his mental development? Is he normal or 
even superior along some specific lines? Similarly, in the case of a 
child of very high IQ, we may inquire in what ways he is superior. 
How uniformly does he excel the average child in intellectual per- 
formance? The intelligence test, furnishing a single summary figure 
to characterize the child's general mental ie\el, often obscures impor- 
tant facts. '1 wo individuals obtaining the same total score may present 
very diflercnt "mental pictures” when their performance along specific 
lines is analyzed. 

The experienced psychologist has always taken this into considera- 
tion in interpreting test scores. The child's performance on the dif- 
ferent pa.rts of an inlelligci.ee scale iind even, when feasible, on 
several diflercnt kinds of tests, is carefully analyzed before a final 
judgment is ofiered. There is a grow'l g realizati(m, however, that the 
question of variation among the individuafs abfutics deserves serious 
and systematic consideration and should be investigated in i^s own 
right, riiis problem is gradtially coming to be regarded as even more 
important than the establishment of the individual’s general level of 
performance. 

In planning an educational progi.o-. for a given individual, in 
helping him to choose a vocation, or in evaluating his qualifications 
for a job in industry, it is of the greatest importance to know his 
strong and his weak points, his particular assets and liabilities. Total 
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scores on intelligence tests can be used only in a crude and general 
sort of educational and vocational guidance. In the comparative study 
of groups, such as the sexes or different racial or cultural groups, a 
consideration of the general level of ability may also prove misleading. 
Let us suppose, for example, that one such group excels markedly in 
ability A and the other in ability B. Jf both are examined with an 
intelligence test which samples abilities A and B to an equal extent, 
no difference in total score will appear between the two groups. Lssen- 
tial and large differences might thus be couvcealed by the practice of 
lumping a number of tasks indiscriminately in the effort to arrive at 
the general mental average called "intelligence.” 

Much confusion has likewise been introduced into the interpreta- 
tion of test results by the common tendency to accupt labels loo lit- 
erally. Thus, if two tests arc labeled measures of '"intelligence,” it is 
often incorrectly assumed that they are measuring the same character- 
istic of the individual. It is therefore most disconceriing to discover 
that the same child may appear dull on one intelligence test and 
above average on another. Such cases are, however, found. Since intel- 
ligence scales consist of a more or less random sampling of different 
tasks, the specific abilities covered by the various tesU may differ. 
Some tests, for example, may be more hea>'ily "ffoadeci” wilTi verbal 
material, others with spatial material. EA^en successive levels of the 
same test occasionally involve different abilities. Thus the Slanford- 
Binet draws more heavily from (he verbal field at the highci year kvels 
than it docs at the lower. Ihc same child tested with the Stanford- 
Binel ai different ages might be fa\'ored at one lime and handicapped 
at another because of the particular abilities called into play. 

if the individual's abilities were all more or less on a dead level, a 
single summary score would be quite informative. But if appreciable 
variation in the individual's standing in different traits is the rule, then 
such a. score is crude at best and may upon occasion be definitely 
misleading. It is essential, therefore, to inquire into the extent of 
variation within the individual. The data on this question have been 
gathered from a variety ol sources. Individuals w'ho exhibit marked 
asymmeiry of development along different lines have been studied. 
Such individuals can be found among the feebleminded and the intel- 
lectually superior, as well as among the normal. Ouantilativc measure- 
ments have also been made on the extent of variability from trait to 
trait in large random samplings. Relevant data have likewise been 
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provided by correlational studies. An examination of the intercorre- 
lations of scores on various tests indicates the degree of correspond- 
ence between the individual’s performance along different lines. 


THE PROFILE ANALVSIS 

In the effort to obtain a more objective and concrete picture of varia- 
tions within the individual than is furnished by tlie general impression 
of the examiner, a psycho<^raph, or profile (hart, of the individual may 
be drawn up. I he psychograph shows at a glance the relative stand- 
ing of the subject on any number of tests or other measures. The 
individual's scores on all tests must first be transmuted into com- 
parable units. Ihis is the fundamental step in any attempt to study 
variations within the individual. The psychograph itself, in the sense 
of a pictorial represent ition, could easily be dispensed with. The same 
information, although in a less vivid form, could be got from an 
examination of a set of scores obtained by the individual, provided 
that all scores are expressed in the same teims. It is in this latter 
respect that the judgment of tlie examiner needs to be supplemented 
by quantitative techniques. Confronted with a sel of scores, some of 
which are expressed in seconds, others as number of words lecallcd, 
and still others as number of problems correctly soKed, the examiner 
can tell little or nothing about the mdividuars relative standing in 
different functions. 

Scores in dilferent tests can be made comparable by the use of any 
of the commofi types of non cs described in CdiapVT 2. If all the tests 
have been standardized in terms of age, each test score can be ex- 
pressed as a menial ngc. For purposes of pixn'ile analysis, it is not nec- 
essary to conxert the MA's to KJ's, ^ince the ('A is constant for any 
one individual. The use oi age norms, however, is not always feasible. 
Some tests, especially in the held of personality, fail to show' large or 
systematic age changes. In such ca>es, the range of variation within 
a single age group may be greater than the hugest difference between 
the averages of dilTcrent age groups The application of the mental 
age concept to adults is also a questionable practice. 

A more generally applicable technique is to convert the individ- 
ual’s score on each test into a perceniile. The percentile, it wall be 
recalled, indicates what per cent of persons are surpassed by the given 
individual (cf. Ch. 2). I hus if the subject falls at the 79th percentile 
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in arithmetic reasoning, wc know that in this respect he excels a little 
more than three-fourths of the standardization group. Were the same 
individual to receive a percentile score of 34 in a vocabulary test, we 
would conclude that his performance in this function is considerably 
inferior to that in arithmetic, since only 34% of the standardization 
group falls below his vocabulary score. The results of different tests 
can also be made comparable by the use of standard scores (cf. 
Ch. 2). In this case, the subjed’s score is expressed as a deviation 
above or below the average of the standardization group, the unit 
being the standard deviation (SD) of the same group. Thus if his raw 
score falls exactly at the average, he receives a standard score of zero. 
A standard score of | 1.00 signifies that the subject excels the average 
by one SD, and a standard score of —.5, that he falls short of the 
average by one-half of the SD. 

Jt should be borne in mind that none of these techniques for con- 
verting scores imo comparable measures yields a scale of equal units. 
They simply express, in terms which are more or less intelligible, the 
relative position of the individual in dilferent tests, but they do not 
furnish a precise statement of the actual anioiini of trait diifercnce 
represented by the various scores. Thus it will be recalled^ that the 
mental age unit corresponds to the average change in score occurring 
during a one-year period. Successive mental ages will not, therefore, 
represent equal increments of ability. Wc know that an MA of 6 
indicates a higher standing lliafi an jVIA of 5, and that an MA of 10 
indicates a higher standing than one of 9; but wc cannot conclude 
that the amcnint (^f difference is the same in both instances. Moreover, 
a change from a mental age of 5 to a meiitul age of 6 docs not rep- 
resent the same amount of improvement on two dilferent tests, unless 
scores on the two tests show identical progress with age. 

Nor can percentile scores be interpreted as equal ability units. As 
was shown in Chapter 3, such an interpretation would imply that the 
distribution of the trait measured is rectangular. Since, however, the 
distributions actually obtained conform more nearly to tiic normal 
bell-shaped curve, individuals will cluster more closely at the center 
of the distribution and scatter as the extremes are approached. Con- 
sequently, a difference of one percentile point at the extremes corre- 
sponds to a much greater difference in amount of the trait than does 
a difference of one pcrcenlile point ncaier the center. The difference 
between an individual who receives a percentile rating of 90 in 
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height and one who receives a percentile rating of 91 is much greater, 
in actual inches, than the difference between two individuals receiving 
percentile ratings of 50 and 51. 

Similarly, standard t^corcs do not represent equal units. By sub- 
tracting a constant (the average) and dividing by a constant (the 
SD), we do not alter the scores in any essential way. The set of nieas- 



Fig. 75. Profile of a Five- Year-Old C'hild on the Thurstone Tests of 
Primary Mental Abilities. (Unpubl. data by courtesy of Miss Sonia 
Avakian.) 


urcs is simply transmuted into a different system of expression, as 
when pounds are changecl to kilograms. The standard scores so 
obtained retain any inaccuracies or inequalities which were present 
in the original measures.^ 

Once the individual's scores on di'^crent tests ha^e been expressed 
in the same terms, his piohle can be plotted. A number of such pro- 
files, obtained in a wide variety of testing situations, are illustrated in 
Figures 75 to 82. Figure 75 is a mental-age profile showing the rela- 
tive standing of a 5-year-old kindergarten child in a number of func- 
tions commonly included in intelligence tests and measured by the 
Thurstone Tests of Primary Mental Abilities (47). This particular 
child is of about average intelligence, as indicated by her composite 

^ Only Vvlien scores on all the tests under considciation arc normally distributed 
will standard scores represent equal amounts c.f ability in the dilTercnt tests. Under 
these circumstances, the standard scores arc identical to the scaled I'-scorcs found by 
Tcfeience to normal curve frequencies. 
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IQ of 103. Her profile, however, shows a range of performance from 
4 years-9 months in the space tests to 7 ycars-8 months in perceptual 
speed. In motor performance she is about one year accelerated over 
her age norm, while her mastery of verbal meaning and quantitative 
concepts is about normal for her age. 


standard 

Score 

70 - 

65 - 

60 - 

55 - 

50 - 

45 - 

40 - 

35 - 

30 - 


Percentile 



Fig. 76. Profile ol a fli.mi Schoc»1 Roy on the DilTcronna! Aptjludu 
(From Bcniictl, SeasliOi’c. and WcMiian, 4, p. LXj 


In Figure 76 be found a percentile pixdilc of a liigh school *boy 
on the DilTcrcntial Aptitude d'ests prepared by flic Psychological 
Corporation (4). I'his sludent, whose IQ is 1 15, had planned ti) enter 
an engineering schoed. His mediocre performance on the test cS space 
relations and his poor showing in moclianical comprehension raise 
doubts about his qiialihcaiions for engineering. On the basis of his 
other scores on the battery and his academic record, the student was 
advised to enroll in a general college course and to' postpone decisitm 
icgarding a field of specialization. An additional feature of interest 
in this type of profile chart is the spacing of percentile vaiucs in such 
a way that distances along the chart correspond to equal units of 
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ability in a normally distributed group. Thus, for example, the distance 
between the 80th and 90th percentiles is much greater than that 
between the 50th and 60th, as would be expected in a normal distri- 



Fig. 77, Profile of an Intellectually Superior Seven- Year-Old School 
Boy. (From DeVo.ss, 10, p. 351.) 


buiion. Converted standard score equivalents are also shown to the 
left of the percentile scale. 

Figures 77 and 78 arc standard-score profiles of two school children 
selected for their outstanding intellectual level.- The psychograph in 
Figure 77 is that of a boy with a Stanford-Binet IQ of 173. This child, 

^ PiUt of the gioup of giltcd chiidren studied by Tcrmaii and his associates at 
Stanford University (.cf. Ch. 17). 
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although above his age norm in all the tests, exhibits marked dis- 
crepancies among his scores. He is highest in arithmetic reasoning 
and computation. In the first of these he stands about 6% SD’s above 



Fig. 78. Profile of an Intellectually Superior len-Year-Old School Boy. 
(From DeVoss, 10, p. 3h().) 

his age norm, and in the second, nearly 5 SD’s. His performance is 
poorest on two of the verbal tests, language usage and sentence mean- 
ing. In Figure 78 is the psychograph of a school boy wath an IQ of 
155, who presents a very different picture. He is best in music-art 
information, second best in history-civics information, and poorest 
in arithmetic reasoning and computation. These examples illustrate 
the fact that intellectually superior children, although above their age 
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norms in most mental tests, may be much farther above average in 
some traits than in others. 

The use of the profile technique in the investigation of occupa- 
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Fig. 79. Occupational Profiles of Men Office Clerks and Garage 
Mechanics. (Fiom Dvorak, 11, p. 12.) 
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Fig. 80. Occupational Profiles of Women Office Clerks and Retail Sales- 
women. (F'rom Dvorak, 1 1, p. 10.) 


tional ability patterns is illustrated in Figures 79 and 80, based upon 
data collected by the University of Minnesota Employment Stabili- 
zation Research Institute (11). Adult men and women in several 
occupational groups were given a scries of tests covering educational 
ability (or “intelligence”), clerical aptitude, motor dexterity, and 
mechanical aptitude. Standard-score and percentile equivalents for 
each test were determined on a representative sampling of the em- 
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ployed population in three principal cities of Minnesota. The two 
contrasting profiles reproduced in Figure 79 were plotted from the 
average scores of a group of male office clerks and a group of garage 
mechanics.-^ It should be noted that these profiles have been plotted in 
terras of converted standard scores, with a mean at 5.0 points, as 
shown along the top of each graph. Along the base are the percentile 
values which correspond to these standard scores.^ Figure 80 shows 
the average profiles of women office clerks and retail saleswomen."^ 
Here, too, the contrast between the two groups is sharp. 

That these patterns represent typical and consistent differences is 
indicated by several additional observations made in the same survey. 
In one check, the accuracy with which individual profiles could be 
assigned to the proper occupational category solely on the basis of the 
test scores was determined. With sample profiles of the corresponding 
occupational groups as standards of reference, the profiles of 158 
employed w'omcn were examined. In this group, the profiles of 90 
women office clerks and 68 retail saleswomen were mixed in random 
order. Correct identification of the occupational category was made in 
92.4% of the cases. Comparisons of occupational sub-groups varying 
in degree of job efficiency, as well as comparisons of group^^employed 
by different companies, also indicated considerable consistency in 
the occupational profile. 

The practical application of occupational ability patterns in voca- 
tional counseling is illustrated by the General Aptitude Test Battery 
of the United States Employment Service (12). The battery consists 
of 15 tests mccisuring 10 selected aptitudes, including general intelli- 
gence, verbal, numerical, and spatial abilities, clerical perception, form 
perception, and several motor tests. Through the testing of many 
groups of employed persons, occupational ability patterns have been 
established for twenty fields of work, representing approximately 
2000 specific occupations. These ability patterns are described in 
terms of minimum scores in those aptitudes found to be significant for 

^The number of garage mechanics was 102, allhough the clerical aptitude average 
is based on only 101 cases. Among the office clerks, the number taking each test 
ranged from 66 to 114. 

^ This correspondence is based on the assumption that the abilities in question 
are normally distributed. 

^ For the women olfice clerks, N varied fiom 164 to 180; for the saleswomen, 
from 65 to 137. 

The proportion of correct identifications to be expected by chance would 
be 50%. 
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each type of occupation. All the aptitude measures are expressed as 
standard scores with a mean of 100 and an SD of 20. Each individual 
to be counseled is given the entire battery, and his scores are com- 
pared with the patterns required for the dilTcrcnt occupations. For 
example, accounting and related occupations were found to require a 



Fii». 81. A PcTSonalitv Piofilc St«>n(J<nil Scores o( d Young Man of 
Fighteen on the Minnesota Multiphasic Personality ln\ent(’)r\ (Unpubl. 
dcUa by coiiilcsy oi Di. William C Bici ) 


minimum score ot 130 in general inlelligeiiee and in numerical ability 
(piltern GN). Occupaiions g^'iupcd under heavy metal structural 
wotk, plumbing and lelated work, and wood structural work called 
lor an NSM pattern, with a mminnim score ol 8.5 in numerical and 
spatial aptitude and in manual dexu^rity. Similar paltcins have been 
w'orked out for twxmty lields ot work studied to dale with this battery. 

An increasing use ol the psychv>graphie appioach is also evident in 
the field oi pcrstmalily testing. In place ol a single index ot “emo- 
tional adjustment” or insi.ibihiy, a multi-dimensional description of 
the individual in terms oi several categories is substituted. An illus- 
tration of such i technique will be Ic aid in Figure 81, which shows 
the prohle oi a young man ot 18 on the Minnesota Multiphasic Per- 
onality Inventory (18). This profile indicates a pronounced tendency 
Uward depression (D), together with somewhat undue concern with 
licalth and physical complaints (Hs) and a deviation of the basic 
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interest pattern in the direction of femininity (Mf). All the scores 
in this profile arc expressed as standard scores, with the mean set at 
50 and one standard deviation equal to 10 points. Thus, for example, 
a score of 75 falls 2.5 SD's above tlic average of the standardization 
sample. 

Among the first tests to employ the profile method of scoring was 
the Allport-Vcrnon Study of Values (1). Mention should also be 
made of the various interest tests, such as those devised by Strong 
(45) and Kuder (25), whose approach is essctitially psychographic. 
An illustrative profile depicting the Kuder Preference Record scores 
made by a young college man is presented in Figure 82. Jn this test, 
the individual indicates his like or dislike for a variety of activities, 
and his choices are subsequently classified inlo nine areas of interest, 
as shown on the chart. The subject in the present illustraiiim shows 
a strong interest in persuasive activities (found to rate high among 
salesmen), as well as in literary and musical activities; his interests in 
the clerical, computational, and artistic areas are also quite high. He 
falls slightly below a\erage in social serxiee interest and far below 
in mechanical and seientifie interests. It should be noted that in the 
Kuder profile chart the percentiles have also been spaced in conformity 
with the assumption of a normal distribution, as was done in the 
Dillcrential Aptitude Tests discussed above, 

rXTRUME ASYMMl-TRirS OF TALENT 

A consideration of individual cases w^hich display conspicuous asym- 
metry of abilities helps us to visualize the extremes of variation which 
may occur within the individual. The study of special talents and 
defects may be regarded as one approach to the analysis of abilities 
and their mutual iiiterrelationships. Arc deficiencies along certain lines 
consistent with intellectual superiority? Do special talents in particu- 
lar fields ever accompany general inlellectiiai backwardness? The oc- 
currence of special talents or defects in a given area of behavior would 
suggest that ability in that area may develop and vary independently 
of ability in other areas. These case reports serve to point up and 
viviiy the findings of the more systematic statistical studies of trait 
relationships. 

Musical aptitude tests seem to have little or no relationship to supe- 
rior intelligence. This is illustrated by data obtained by L. S. Holling- 
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worth (22) in a study of 49 intellectually gifted children. All the sub- 
jects were enrolled in special classes conducted for children with IQ’s 
of 135 or higher. The median Stanford-Binct 10 of this group was 
153, and the range extended from 135 to 190. The children were 
tested with the Seashore Measures of Musical Talent. 1 heir scores 
were evaluated in terms of fifth grade norms, since the chronological 
ages of the gifted group corresponded closely to those of the iilth 
grade school children in the Seashore standardization sample. Below 
will be found the a\crage percentile score obtained by the gifted group 
on each of the Seashore tests. 


PilLh 

47 

Intensity 

50 

Time 

58 

C onson nice 

48 

Ton.il mcmoiy 

52 


A percentile score of 50, it will be recalled corresponds to the middle- 
most score ot the sl<uidardi/ati(»n group and thus repiesents a ‘'nor- 
mal,” or aveiage, performance, I he fact that all the average percentile 
scores ol the supeiior group w^cre so close to 50 indicates tluft musical 
aptitude is distributed among these children in \ci> much the same 
fashion as in any group of the Stime age chosen at random. Although 
m intelligence test pcrfoimaii^e these sub)ccts were all within the 
upper ol the general population, their individual percentile scores 
on the music tests r uigcd irom zero to 98. 

Individual cases of intellectually superior children with a pro- 
nounced dclicicncy in music can readily be found. One ] 0-year-old 
school boy, lor example, had an IQ of 15! but obtained scores which 
ranged from the zero to the 30ih percentile on the Seashore music 
tests (20, p. ]7^1. His school work in such subjects as reading, arith- 
metic, and elementary science was excellent. But his music teacher 
regarded him as a complete failure and advocated that he jcpcat the 
grade! 

Case studies of ai ithnictical prodigies and “lightning calculators'’ 
indicate that a high level of numerical aptitude can likewise occur in 
individuals of average or inferior intelligence. Many such cases, from 
the early Greeks to the end of the last century, were co'lccted and 
described by Scripture (40) and later by Mitchell (30). More re- 
cently, much of this material was brought together in a collection of 
papers prepared and edited by Bryan, Lindley, and Harter (9). In 
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regard to their achievements along other lines, or their practical ability 
to succeed in everyday life, mathematical prodigies run the gamut 
from genius and eminence of the highest order to mental dullness. 
A few would no doubt be classified as “borderline” or lower on current 
intelligence tests. At the other extreme are such men as Gauss and 
Ampere, whose exceptional talents covered a wide range, and who 
made distinguished contributions in mathematics and allied fields. 
These men were “lightning calculators,” but also possessed very 
superior ability along nianv other lines. For the present purpose, 
however, we are concerned with cases of asymmetrical development 
in which prodigious arithmetic powers are coupled with mediocrity 
or deficiency in other respects. 

Henri Mondeaux (cf. 40), the untutored son of a poor woodcutter, 
is a famous example of remarkable arithmetic ability in an otherwise 
dull person. In his childhood he received no instruction, but was sent 
to tend sheep at the age of 12. While engaged in tliis occupation, he 
amused himself by counting and arranging pebbles; by this means he 
learned to carry out arithmetic operations. He worked out for his own 
use many special devices and aids to compulation. After long exercises 
at these calculations, he offered to tell people he met the number of 
seconds in their ages. At this time, a schoolmaster became interested 
in him and offered to instruct him. Unfortunately the boy had a very 
poor memory for names and addresses and he spent nearly a month 
searching the city before he was able to locate his benefactor. Mon- 
dcaux was subscquentl} cxliibited by his teacher at several colleges 
and universities and in 1840 was presented before the Academy of 
Sciences at Paris. His was not merely a talent for routine calculation, 
but he demonstrated his ability lO solve, by ingenious devices of his 
own making, complex problems such as the following: 

There is a fountain containing an unknown quantity of water; around 
it stand people with vessels capable of containing a certain unknown quan- 
tity. They draw at the following rate* the first takes 100 quarts and * la 
ot the remainder; the second takes 200 quarts and ^ i.j of the remainder; 
the third, 300 quarts and of the remainder, and so on until the foun- 
tain is emptied. How many quarts were there? 

Mondeaux gave the correct answer to this problem in a few seconds 
and then explained the method whereby he had arrived at the solution. 

A similar case is that of Tom Fuller, an African slave brought to 
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America in 1724 at the age of 14. He could neither read nor write 
and received no formal instruction. As in the case ot Mondcaux, his 
arithmetic was entirely self-taught. It is reported of him that when 
asked how many seconds a man had lived who is 70 years-17 days-12 
hours old, he gave the answer, after Wi minutes, as 2,210,500,800 
seconds. One of his questioners had meantime been computing with 
paper and pencil and reported that he had arrived at a different num- 
ber, which he proceeded to read off. At this, Fuller immediately 
pointed out that his questioner had forgotten to allow for leap years! 

A few cases of '‘lightning calculators"' have been directly observed 
and investigated by psychologists. ‘ 1 he information thus obtained, as 
well as the careful analysis of available reports on arithmetic prodigies, 
has brought to light certain chaiactenstics ot these individuals which 
may account tor their talent. In most ca^es, the individual has worked 
out a number ot short cuts and special devices which enable him to 
compute far more eflicicntly than is ordinarily possible. Secondly, 
such individuals have usually mcmori/cd many more number combi- 
nations, such as squares, ciilxs. roots, and products, than are at the 
disposal of the average man Arithmetical prodigies invariabl} mani- 
fest a very keen interest and fascination tor numbers. As result, 
they devote much time to analysis of computation methods and to 
drill which would otherwise prove highly monotonous. Many also 
seem to have a large perception span which enables thc’n to grasp a 
long series of numbers simultaneously, as well as vivid imagery, 
making possible "mental eomputatiorr’ v\ithoul the aid ot paper and 
pencil Moreover, such piodigies often build up a wsaltli ot associa- 
tions to numbers, and are thus able to use numbers in their ihmking 
in much the same w i\ lliat the average person use > verbal symbols. 

The most spectacular examples of special talent are the so-called 
idiots senants, This teim, winch literally means "wise idiots,” is some- 
what misleading, since tlie usual idiot s<iva’it is neither particularly 
wise nor an idiot. He is not stiff iciently deficient to be classified as an 
idiot, but is frequently found at the moron or borderline level. And 
he is "wise” onl> in a veiy limited field. In the practical management 
of his own life he is ordinarily a complete failure. 

As might be expected, idiots savants are extremely rare. Because of 
their remarkable accomplishments, however, they attract considerable 
attention, and a number of fairlv complete descriptive accounts are 
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now available. Most of the earlier cases were summarized by Tredgold 
(50). More recently, Rife and Snyder (37) addressed an inquiry 
to 55 American institutions for the feebleminded, through which they 
were able to locate 33 cases of idiots savants. Of these, 8 showed a 
special talent in music, 8 in mathematics, 7 in drawing, and 10 in 
miscellaneous areas including mechanics, meme')ry, and motor coor- 
dination. A few of these cases manifested skills which were narrowly 
limited and of dubious psychological significance. On the other hand, 
a number gave evidence of well-rounded achievement in a fairly broad 
area. 

Several cases of special talent in pictorial art have been found 
among the feebleminded. Such individuals are able to execute exccl- 
jent reproductions of well-known paintings. Occasionally this talent 
passes beyond mere copying and suggests real creative genius. Such 
a case is that of Gottfried Mind (50, Ch. XV), diagnosed as a cretin 
imbecile. His mental deficiency, manifested from an early age, was 
such that he was unable to learn to read or write. His movements were 
awkward, his hands large and rough, and his general appearance that 
of the traditional mental defective. Since he showed considerable talent 
for drawing, he was given some instruction in this field. His subse- 
quent success in pictorial art was phenomenal. Because of his excellent 
drawings of cats, he came to be known as “The Cat's Raphael.” In 
addition, he produced drawings and water-color sketches of deer, 
rabbits, bears, and groups of children, which were remarkable for 
their life-like quality and masterly execution. His fame spread 
throughout Europe and one ( f his pictures of a cat and kittens was 
purchased by King George IV of England. 

An equally remarkable case is that of .1. H. Pullen, who has been 
called “The Genius of Earlswood Asylum” (50, Ch. XV). This indi- 
vidual had extraordinary mechanical ingenuity coupled with talent in 
drawing and carving. In other respects he was very deficient. He did 
not talk until the age of 7, and for a long time uttered only the word 
“muvvcr.” Probably because of a severe hearing deficiency, his speech 
never came up to normal. He was taught by his family to write and to 
spell the names of simple objects, and this constituted the extent of 
his schooling. From an early age, he spent much of his time in drawing 
or in carving ships out of pieces of firewood, occupations in which he 
showed considerable proficiency. At the age of 15, he was admitted 
to Earlswood Asylum, where he was put to work in the carpenter’s 
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shop and soon became an expert craftsman. During his sixty-six years 
at the asylum, he produced an impressive arra> of beautiful and highly 
ingenious objects, including crayon drawings, carvings in ivory and 
wood, excellent models of ships, and various mechanical devices. Oc- 
casionally he even designed his own instruments to help him in his 
work. 

One of Pullen’s constructions was a representation of a gigantic 
human lorm, thirteen feet high. This full-fashioned "lobot” could be 
made to execute a variety of movements, such as raising the arms, 
rotating the head, protruding the tongue, ai'd opening and shutting 
the mouth or eyes. Another remarkable construction was a model of 
a ship, beautifully executed in the minutest detail. This model re- 
quired over three years for its completion and attracted universal 
admiration when exhibited. Pullen’s ^\ork revealed artistic imagination 
as well as mechanical ingenuity, skill in planning, and painstaking 
execution. Being cut off from many ordinary sources of stimulation 
by deafness, it is probable that he concentrated all his efforts from 
childhood upon the development of this one remarkable talent. In 
regard to general personality development, he is described as childish 
and immature, emotionally unstable, and lacking in commoivsense. 

Special talent in music has also been observed among the intellec- 
tually deficient. A case (50, Ch. XV) of exceptional musical ability 
combined with serious defect in other respects is that of a woman in 
the Salpetriere, a famous French institution for the feebleminded and 
the insane. This patient was an imbecile, blind from birth, a cripple, 
and affected with rickets. She was, however, able to sing without error 
any selection which she had heard. It became customary for her 
lellow-inmates to come to her so that she might correct their mistakes 
in singing. She attracted wide attention, and it is repoited that the 
composers Liszt and Meyerbeer visited her “singing class” to bring 
their encouragement and consolation. 

More rcccntl), another instance of musical talent combined with 
intellectual deficiency was investigated by the use of standardized 
intelligence tests (cf. 29), This was the case of a boy admitted to a 
feebleminded institution at the age of 14. He came from an intellec- 
tually superior family which included many musically gifted individ- 
uals among its members. As a child, the subject was intellectually 
normal and manifested his musical talent from an early age. When 
three years old, he had pneumonia and meningit, , and since that time 
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he underwent steady mental deterioration. Upon admission, his IQ 
was 62; at the last testing, it had dropped to 46. He was then over 
20 years of age and had a mental age of 7 years-5 months. His mem- 
ory was unimpaired, however, and he retained his excellent musical 
ability. Although never known to compose a piece, he could play 
difficult music by ear and was also able to read difficult musical com- 
positions at sight. 

The feats of memory performed by some feebleminded individuals 
have often attracted notice. Trcdgold (50, Ch. XV), for example, 
describes a 65-year-old mental defective in Earlsw^ood Asylum with 
a remarkable menu^ry for historical facts. He could repeat the dates 
of birth and death and the essential facts in the life of any prominent 
character in history. This knowledge was acquired largely by rote, 
through poring over all available books on biography and history. It 
was not, however, a matter of sheer meaningless repetition, as was 
shown by the subject’s responses when questioned on the material. 
Another patient at the same institution showed an excellent memory 
for dates and occurrences which had come within his owm experience. 
He proved a useful source of reference on local happenings in the 
institution. 

Arithmetical prodigies have also been found among the ranks of 
the feebleminded. Usually the skill manifested is confined to the 
mechanics of computation. Thus the subject may perform long and 
complicated calculations within a very short time and without the aid 
of paper and pencil. A favorite feat is to determine the number of 
minutes a person has lived, from a knov/lcdge of his age or date of 
birth. Multiplication of three-place numbers, naming square roots and 
cube roots of four-place numbers, and similar difficult operations 
have also been executed within a few seconds. In some cases, this 
numerical aj)titudc goes beyond routine computation, as is indicated 
by the individiiars ability to solve mathematical problems expressed 
in fairly elaborate and confusing terms. An example of fairly complex 
numerical aptitude, appearing early in lilc, is provided by the case of 
a 27-year-old man with a mental age of 3. described by Rife and 
Snyder (37, pp. 553-554). They wotc; 

As a small child he would scribble figures on the bathroom tiles or 
other places whenever he could get hold of a pencil. He never learned 
to talk, and even now cannot perform such simple requests as pointing to 
his eyes or ears. In school he could do absolutely nothing, so vvas sent 
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home, and at sixteen was admitted to the Institution. His hearing is 
normal. . . . Although he is incapable of carrying on a conversation, or 
ol understanding spoken requests, one may make one’s desires along 
mathematical lines known with a pencil. When a pencil and paper were 
taken, and the figures 2, 4, and 8 written in a vertical column, the patient 
immediately continued the series 16, 32, 64, etc. When the scries 2, 4, 16 
was started, he immediately continued this one, the sixth number being 
4,294,967,296. Then 9 — 3 was written, in the attempt to indicate square 
root. Under this, several numbers such as 625, 729 , and 900 were written. 
The square root of each was immediately and correctly written. Any prob- 
lem ol multiplication of several digits by seveii^il digits was done immedi- 
ately, only the answer being written. 

THH IVlhASURl'MLNT OF TR\IT V\R]Anil FiY 

The term trait variability vvas fir>.t proposed by Hull (23) to designate 
variability from trait to trait within the individual, in contrast to indi- 
vidtuil \aruibilit\\ which rclcrs to the diTerences among individuals 
in a single trait. Any td' the methods commonly employed to measure 
individual variability can be applied to the measurement of trait 
variability, provided that the scores on different tests are expressed 
in the same units. In view of the extreme asymmetries of lalcnl occa- 
sionally observed, the question arises regarding the CMcnt of trait 
variah/iiity found among pec^ple in general. 

In an early study by Hull (23). the extent of trait variability was 
gauged by comparing it with the amount of individual variability 
within single tests, 1 lie data consisted of the scores of 107 high 
sciiool frcvhmcn »m 35 tests, inciuding several sub-tests from intelli- 
gence scales, as well as tests of motor functions, attention, percep- 
tion, and personality characteristics. All scores were transmuted into 
standard sc(^rcs and the SI3 of each person’s 35 scores was then com- 
puted. There wxmv thus ('obtained as many SD’s as there \^erc subjects, 
viz., 107. The average of these 107 vSD’s was 6.33. Hull compared 
this figure with the SO of 7 which, in the scale of units employed, rep- 
resented the variability fnnn person to person (i.e., individual varia- 
bility) in amy one of tlic tests. After allowing for possible chante 
errors of sampling and measurement, Hull estimated that the trait 
variability was about 80% as great as the individual variability. 

It should be noted that such an estimate of the extent of *.rait varia- 



Variability within the Individual 


All 


biiity is limited by the conditions of the particular investigation. The 
estimate will be affected by the number of tests employed, the range 
of functions tested, and the heterogeneity of the group (16). The 
more homogeneous the group, the smaller will be the SD’s repre- 
senting individual variability in any of the tests. It will be recalled 
that these SD’s provide the units in terms of which the individuaTs 
score in each test must be expressed before his trait variabiliu can 
be measured. It follows that the more heterogeneous the group, the 
smaller will the trait variability appear, since it will be expressed in 
terms of a larger unit. Thus such estimates should not be generalized 
beyond the type of population on which they were obtained. 

That the 80% estimate is not too far irom what would be found 
in other typical groups is suggested by the findings of a more recent 
study conducted in France (34). The average trait variability was 
measured in cacli of four different groups, the number in' tests used 
in each group being shown below: 
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From the results obtained witli these groups, it was concluded that 
trait variability tended to be slightly over 75%; as great as the indi- 
vidual variability of the group. 

The distribution of ca h inJividuafs scores on the different tests 
seems to follow Ibe general form ol the nornial eurve. Most of the 
individtiars scores cluster about his own average, wath only a few' 
scores deviating widely in cither direction (23). The extent of trail 
variability differs widely Irom person to person, some individuals 
being considerably more uniform in their abilities than others. In the 
Hull (23) study, for example, the individual SD's for trait variability 
ranged from 4.30 to 9.09. 1 hese indices ot trait variability are them- 
selves normally distributed, and show no evidence for the presence 
of distinct "types” in reference to degree of trait heterogeneity (34). 

A question of considerable theoretical as well as practical interest 
is whether any relationship exists between ability level and extent of 
trait variability in different individuals. Some investigators have found 
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no significant correlation between the two (7, 23). In these studies 
there appeared to be no relationship between the individual’s trait 
variability and his standing either in specific tests or in the average of 
all the tests The groups studied, however, consisted of either high 
school students or college freshmen and were therefore relatively 
homogeneous in ability It is possible that the correlation might be 
higher if a wider ability range were coii'Nidered Moreover, the rela- 
tionship between ability level and trait variability ma) be curvilinear 
in which case the use of the usual Pearson correlation coefficient 
would underestimate the degree of relationship 

There is some evidence m the data collected by other investigators 
which suggests a negative correlation between ability le\el and trait 
variability (34, 49) In one study (49), the duller children showed 
more unevenness ot abilities than did the avciagc, and the average 
were more uneven than the bright When the trait \ariabihtv of the 
average group was taken as 100%, the trait variability of the dull 
group wa^ lound to lx IIO^^ nnd that ol the bright group 92% 
Although the trend tow<ird an invene lelationship between intelhiienec 
and scatter was found consistently with all the inklligcnec k 4s em- 
plo>ed in this study the actual correlation was low Similar^ in the 
French stud) on four ch^crent groups cited above (34), a eonsistciit 
tendency was observed for trait varialrlity to be gieater among the 
poorer subjects Below aie the average trait variabilities of sub|eels in 
each of the lour gioups the subjects bemg ela'^sil\d aeeoiding to 
then eornposite performance level on all the tests 

total It Ion int( Ih hi \(tatnncl VnisS ht / 

tn ill 1 1 rs Candditi\ Sclioo^(jin\ ippunlicc i liddun 

SiipenoT [Fppu '’S 2 SI 143 3 IS 123 

\\a Mitldk so 3 66 3 !9 164 

Inkiioi Lnuci 4 X) 3 83 3 42 1 67 

^ Uy <ii]j\iim< »r cliOonship is rnc ml tint tli l\p of Tclilnrishp is not tin orm 
thioiithoui iht I nj-L Iti ex unplt silanes m ly jiiLitis^. is mo int oi ccIul iti m 
incic is;.s u\ to I point 'iml thui dtcinse is llio hi}.lKSt tduc itioii il levels irt 
rcithcd Oi iht ude of nsi isc m iv viiy m d Ikitiil p irts of tin. i me In Uovmis 
iidv lint viriinilit> (/) ciirviiintn nuiel uion vv is LOnip itcd Sul st/ll no cvi 
(It )tc ol sunilit. n t ilIUrjisIup w is loind In tins bludv hovvc\Lj the subjects wue 
quite homcvtiHOu ill heiiij. co le4< tieslimen 

The unit in which ilicsc v iliics ue upnit d is ViSD Since most of these tnit 
varnbihtie ue dost to 3 it wi'l he noted Ihu they aie ippioxmialcly 4 . is laigt 
as the SD of the distnhuUon c^ individuil clifttienccs 1 he itl uivtiy low li ut \ uu 
bilitics found imong llie school diildien ire piobibi> the tcsull ot the ‘'luill numb r 
and icsiiieted \ uicly c4 tests employed with this jioup 
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Relevant data are also provided in the study conducted by DeVoss 
(10) to determine whether gifted children arc more specialized in 
thcii abilities than normal children, i.c., whether they show wider 
trait variability. A group of 100 subjects were selected on the basis 
of mental age from a larger group of “gifted children” studied by 
Tcrnian and his associates (cf. Ch. 17). The menial ages of DeVoss’ 
group ranged between 14 and 15-5, with an average of 14-8: tlic 
chronological ages ranged from 8-6 to 1 1-1. The average IQ was 
149.4, and the range from 136 to 180. In school grade, the children 
were scattered from the third to the eighth grade, inclusive. These sub- 
jects were compared with a control group of 96 unselecled eighth 
grade school children of approximately the same mental ages as the 
superior group. Both groups were given the Stanford Achievement 
Test, consisting of seven sub-lests on different school subjects, as well 
as information tests in special fields. All scores were reduced to stand- 
ard measures. 

Examination of the inter-test variations within each subject’s scores 
revealed many differences large enough to be significant. By means of 
a specially devised statistical formula,’’^ it was possible to estimate how 
large a trait difference might be obtained simply through the operation 
of chance factors, such as inadequacies in the tests employed. Upon 
the application of this formula, it was discovered that a large percent- 
age of the trait differences fell beyond the chance limits and must 
therefore represent a true discrepancy in the individual's standing in 
the traits compared. Jn Table 27 are given the percentage of trait 
differences, in both the gi'ted and control groups, which fell outside 
of the distribution of differences expected by chance. The percentages 
in the gifted group are given above the diagonal, those in the control 
group below the diagonal. Tlie tc.‘‘ts which are being compared are 
indicated in the top row and first column to the left. Thus, in the gifted 
group, 24% of the differences between scores on arithmetic reasoning 
and computation fell beyond the chance limits; the corresponding per- 
centage for the control group is 34%, and so on. 


A formula for the computation of the PH of the difference between an indi' 
vidual’s standard scores on any two tests: 

pba.ff .6745V2~'T;7=T:7r, 

in which rn and ijn are the reliability coefficients of the two tests. 
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TABLE 27 Percentage of Trait Differences among 100 Gifted and 96 
Control Children Which Fall Outside of the Chance Limits * 


(From lV\"os 


Tests to [ic § 3 S? 

Compared -5 S. S S 


Arithmetic 


computation 

Arithmetic 


24 

32 

reasoning 

Word 

34 


30 

meaning 

Sentence 

39 

30 


meaning 

Paragraph 

40 

26 

13 

meaning 

Language 

35 

26 

14 

usage 

39 

33 

24 

Spelling 

Science 

31 

36 

28 

infoimalion 

35 

20 

24 


10, p. 325) 


Sentence 

meaning 

cC 5 

2 

& "i- 

Spelling 

1 Science 

1 information 

31 

34 

37 

26 

33 

31 

31 

38 

29 

29 

13 

24 

25 

26 

23 


26 

24 

28 

29 

17 


33 

27 

34 

23 

17 


30 

32 

33 

25 

30 


30 

25 

21 

30 

31 ^ 



Gifleil ^roLip abo\e tijf diagonal; upper right-hand block. Contiol group below 
ihc diagonal, lower ieti hand bloek. 


It will be noted that in every test pair compared there are found 
dilTcrences over and above those expected by chance. This is true of 
both gifted and control groups. Tiie percentages of such dilTcrences 
arc also closely similar in the two groups, test by test. In the gifted 
group, these percentages vary from 13 to 38, in the control group 
from 13 to 40. The average percentages are 28.89 and 27.82 for 
gifted and control groups, respectively. Out of the 28 inter-test com- 
parisons given in Table 27, the gifted group has the larger percentage 
of excess dilTcrences 13 times, tlie conlroi group has the larger per- 
centage 12 times, and the two have identical percentages in 3 cases. 
Thus there seems to be no appreciable or cc^nsistent dilTcrence in trait 
variability betw^ecn intellectually normal and superior children. 

In conclusion, the available data on trait variability offer no sup- 
port to the popular notion that intellectually gifted children show a 
higher degree of specialization than do the normal. In fact, if any 
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difference exists in this respect, it is the duller individual who appears 
to be more specialized than the normal, but additional research is 
needed to establish such a relationship. 

Two additional variables whose relationship to trait variability has 
been studied are practice and age. In a re-ana lysis of data collected by 
several investigators, Preston (36) has shown that trait variability 
tends to decrease with practice and to increase with age. The effect 
of equal practice is to make the subject more uniform in the various 
practiced tasks. Age has the opposite effect upon trait variability, the 
older individual showing more scatter or specialization of ability. It 
cannot be assumed, of course, that age per se accounts for such 
changes in trait variability. The groups compared in these investiga- 
tions also differed in educational level and probably in other respects. 
It is entirely possible, for example, that education may increase trait 
variability, even though practice tends to decrease such variability, 
nducation obviously docs not consist of “equal practice” in all intel- 
lectual functions. Not only does the amount of practice vary in 
different areas, but motivational changes and other complicating 
influences are probably introduced in different ways for different 
individuals. 'The effects of education on trait variability may thus be 
quite unlike those obtained in simple practice experiments.'^ 

The relationship between personality characteristics and unevenness 
of ability also presents a fruitful area for research, but the available 
data on this question arc still highly tentative.^- Interest in this rela- 
tionship has been stimulated in recent years by the development of 
the Wcchsler-Bellcvue b^telligence Seale. Jt has been suggested that 
tlic pattern of the individuaFs scores on the different sub-tests of this 
scale may serve as an index of v.irioiis emotional disorders. This 
application of the concept of tiait variability has attracted wide atten- 
tion among clinicians, but it would be premature to evaluate it in its 
present stage. 

INTnU/ORRET.AlIONS OF TEST SCORES 

The examination of extreme examples of asymmetrical development, 
as well as the measurement of trait variability within individuals in 

^ ^ The relationship of age and education to the specialization of ability w II be 
consideied moic fully in the following chapter. 

'“C'l., eg., Bown (7), Freeman (14). 
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general, suggests that superior talents in one line may be associated 
with inferior abilities in other respects. It is not to be concluded from 
this, however, that compensation is the rule. Superior standing in one 
trait does not imply inferiority in another. We have cited only exam- 
ples in which individuals with a high standing in a certain trait A 
make a poor showing in a second trait B. We could with equal facility 
find cases in which the individual is superior m A as well as B, or 
superior in A and average in B. This, in fact, is what we mean by a 
low or zero correlation. If various abilities are specific and mutually 
independent, so that an individuaFs standiiig in one tells us nothing 
about his lelative standing in another, we should expect the correlati m 
between such abilities to be zero or very low. 

Correlation thus offeis another approach to the analysis of tiait 
variability. It should be noted that these arc literally altcrnati c ap- 
proaches or ways of expressing the same facts. Ihus the asymmetries 
of ability illustrated in an earlier section are only extreme cases of 
trait variability. Similarly, it can be shown by simple algebra that 
measures of trait variability depend upon the intercorrelations among 
the traits under consideration, and that the one type ol measuic can 
be derived from th(^ other (cf. 16, 35 ^ The aveiage tiait variability 
of a group of individuals in a given series ol tests can be lound by the 
following formula: 

V - 1 - ' - 

n n 

in w'hich, 

V IS the average van jnee ' > within the individual, expressed in terms 
ol the variance .imong individuals m a sint,le tea, 

n is the numhei ol tests, and 

IS the sum ol all the intercorrelations among tlicsc tests, each corre- 
lation being cnteied tv\ice. lor example, the coiielation betw'cen 
tests 1 and 2 v^ould appear m the complete con elation t.iblc tiS 
as r^_, and as i j^. 

By means of this formula, it can readily be shown that il all the tests 
were perfectly correlated \ iih each other, trait vaiiabiiity would be 
zero (35). On the other hand, if all the intercorrelations among the 
tests were zero, the average trait variability would be nearly as high 
as the individual variability, and would approach the latter as the 

’^Thc variance is the squaic of the standaid deviation. 
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number of tests increases. An examination of correlation coefficients 
can thus provide the same type of information \vhich is obtained by 
the measurement of trait variability. 

Profile asymmetries can likewise be investigated by the correlation 
technique. Between what areas of ability are such asymmetries likely 
to occur? Do certain functions tend to vary together wilhin the indi- 
vidual, so that a deviation in one will be accompanied by a similar 
deviation in the other? I'hesc arc the types of questions that are an- 
swered by correlation coefficients. Certain functions have long been 
recognized as “special aptitudes,” a designation which carries with u 
a tacit presupposition of low' or zero correlation with other functions. 
Among the most familiar arc musical, artistic, and mechanical apti- 
tudes. Jt will be recalled that these are some of the areas in which 
marked asymmetries of talent have been reported. We may now 
inquire what tlie c^Trclational approach has to oiler regarding these 
aptitudes. 

Tests of musical aptitude have consistently shown low — and usu- 
ally negligible — correlations with measures of “general inlelligence.” 
In a group of 74 college students, a corrclalion of “.17 was found 
between intelligence test scores and a test of musical appreciation 
(19). In the same group, a dilTcrent form of the music test yielded 
a correlation of -.15 with the intelligence test. The fact that these 
correlations are negative mig!il suggest a slight tendency for the more 
“intelligent” individuals within such a group to be poorer in music 
appreciation, but the correlations arc loo low to be significant. In 
another investigation witl, 230 college students, intelligence test scores 
were correlated with each of the Seashore tests of musical sensitivity. 
The correlations were all positive but very low, only one being sig- 
nificantly higher than zero ( 13 ). Other studies with the Seashore tests 
have yielded equally low correlations with intelligence tests at all age 
and grade levels (38; 31, pp. 335-340). 

Aptitude in pictorial art shows a similar independence of general 
intelligence. Correlations ranging from .14 to H-.28 were found 
between the Meier Art Judgment Test and intelligence test scores in 
various groups of high school and college students (28 ) . None of these 
correlations was statistically significant. Among elementary school 
children, an equally low relationship has been found between intelli- 
gence test scores and test performance in either representative (6) or 
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creative drawing (47). A few of these correlations arc statistically 
signilicant, indicating more than a chance relationship, but all are low 
enough to permit marked asyjnmctry in individual cases. 

Mechanical aptitude also appears to be a special ability. The Sten- 
quist Assembly Tests, involving the construction of common mechani- 
cal objects such as lock, bicycle bell, and mouse trap, correlated .23 
with intelligence test scores in a group of 267 seventh and eighth 
grade boys (43). Although significantly higher than zero, this correla- 
tion indicates only a slight degree of relationship. Jn the standardization 
of the Minnesota Mechanical Ability Tests (32), a correlation of .13 
was found in a group of 100 junior high school boys between JO and 
a mechanical aptitude battery consisting entirely of apparatus or 
manipulation tests. Similarly, an investigation on 225 college men 
show^ed a correlation of only .07 between a vocabulary test and the 
Minnesota Paper Form Board 4est (2). I'he latter is a paper-and- 
pencil test of the ability to visualize spatial relations. Vi)cabulary 
tests, which measure the subject's understanding of word meanings, 
have been found to correlate so highly wdth the majority of common 
inteliigcnce tests as to be practically interchangeable with them. From 
these examples, it is apparent that in large groups of subji.^ts of dif- 
ferent age and academic level, otdy a very Ic'W positive correlation 
exists betw'een spatial or mechanical ability and the verbal type of 
intelligence test. Wi.en the mechanical ])roblcnis arc presented verb- 
ally, the correlations with intcfligencc tests are usually higlier because 
of the common inlluence of the comprehension of verbal directions, 
knowledge of words, and general facility with verbal material. 

It should be noted that superior ac/iievenu’nt in the fields of art, 
music, or mechanical work requires other abilities in addition to the 
spec'al talents which have been discussed. Many of the abilities 
measured by intelligence tests arc needed for tlie type of training 
wliich makes higher levels of accomplishment in these fields possible. 
The professional application of these talents, moreover, often demands 
other skills in the comprehension of complex verbal or numerical 
concepts, effective social communication, and the like. It is not sur- 
prising, therefore, that when indices of actual achievement are em- 
ployed, successful performance in music, art, or mechanical work 
shows a closer relationship with intelligence test scores thari is found 
between tests of these special aptitudes and intelligence tests. 

Among school children, fairly high correlations have been reported 
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between intelligence test scores and such measures of musical achieve- 
ment as grades in music classes or ratings by music teachers (31, 
pp. 335-340). That these correlations arc higher than those found 
with musical aptitude tests may result in part from the contribution of 
other abilities besides musical aptitude in determining such achieve- 
ment and in part from a probable halo effect in the grades and rat- 
ings. For SLiccesslul accomplishment in almost any speciali/ed area, a 
certain minimum level of ''general intelligence'’ is an essential pre- 
requisite. In surveys of artistically talented high school students as 
well as recognized adult artists, the average inlelhgence test perform- 
ance was found to be clearly superior to that of comparable, artisti- 
cally unsclccted samplings (4S). Similarly, a group of artistically 
superior children studied at the University of Iowa had lO’s ranging 
from 111 to 166 (27). Within such groups, however, there is no 
evidence that the degree of artistic merit oi recognition is correlated 
with intelligence test score. Similar data for the mechanical field arc 
provided by the average intelligence test scores obtained by persons 
engaged in different vocations. Inventors, engineers, architects, and 
other pel sons occupied with creative mechanical pursuits receive con- 
siderably higher iiUelligencc test scores than the average. 

All this simpiv suggc'^ts, however, that more than a single special- 
ized talent is required lor the higher levels of achievement in music, 
art, or mechanics. The fact remains tiiat a highly developed ability in 
any of these areas may coexist with low ability along other lines. In 
such cases, the specialised ability simply diK's not have as much 
‘'social market value,” either academically or vocal ioriall}', as it would 
if it were accompanied by other abilities. 

Of the principal aptitudes suggested by the case reports of extreme 
asymmetries of inielleelual development, only ii/iinerical ability re- 
mains to be examinech Despite tlie indisputable presence of "mathe- 
matical prodigies'’ who are deficient in other respects, numerical 
aidlity has not usually been classed with special aptitudes. Arithmetic 
tests are also frequently included m intelligence scales. Recent corre- 
lational analysis has demonstrated, however, dial the relationships 
between verbal and numerical tests me much lower than those within 
cither group of abilities. In many investigations, the correlations be- 
tween verbal and numerical tests were no higher than those between 
verbal icA^ and the various special aptitudes discussed above. 

In one study (39), 210 college men were given five verbal and 



486 Differential Psychology 


four numerical tests. The average correlation between all possible 
pairs of verbal tests was .49; the corresponding average correlation 
for the numerical tests was .34. When verbal and numerical tests were 
paired off, the average of the correlations thus obtained was only .14. 
Even this rather low correlation probably resulted in part from the 
fact that in at least one of the numerical tests the problems were 
expressed in verbal terms, and in all the tests the directions were 
given verbally. Among 225 college men tested in another investiga- 
tion, a correlation of —.01 was iound between arithmetic reasoning 
and vocabulary (2). In still another study (3), 140 college women 
were tested with two verbal tests (vocabulary and analogies) and two 
numerical tests (arithmetic reasoning and number scries completion). 
The correlation between the two verbal tests was .65 and that between 
the two numerical tests .58. When verbal and numerical tests were 
paired oft’, however, the average correlation was only .16. 

These various findings suggest that verbal and numerical tests ap- 
pear to be mcasurimz two “special aptitudes'’ in the same sense as the 
other tests discussed above. 1o be sure, these two abilities are less 
specialized among younger siibjects and individuals of lower educa- 
tional level, a finding whose implications will be considered in the 
following chapter. Hut among all individuals, they are '•fiiHicicntly 
differentiated to hd\e led to the now common practice of reporting 
separate scores tor “limzuistic’' and “numerical,” or “quantitative,” 
intelligence on most intelligciijcc tests. 

WHAl 1)0 “l\ri I 1 !C.l NC 1 II sis" Ml \Sl \U ? 

After this brief ()ver\iew of the most commonly ob'-erved “special 
aptitudes," wc may well ask what is lelt of “general intelligence.” 
Perhaps the content of “intelligence tests'’ nuiy jnoMde a clue to the 
connotatit)ns ol the term “inteiligcncc" as a working concept. It will 
be recalled that the original aim (4 intelligence tests was to sample 
a large number ol different abilities in carder to ariive at an estimate 
of the subject's general level of performance. In so far as the individ- 
ual’s standing in specific functuins differs, such a gene rad estimate is 
unsatisfactor} . It is apparent, however, that current intelligence tests 
do not even furnish an adequate estimate of the average ability of the 
individual, since they are overweighted with certain functions and 
omit others. Thus in the non-language and performance tests of 
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intelligence, spatial aptitude plays the dominant role. Most paper-and- 
pencil tests, on the other hand, measure chiefly verbal ability and, to 
a slighter extent, numerical ability. Since the latter type of test is by 
far the most frequently employed, the term “intelligence” has come to 
be used almost synonymously with verbal ability. Mental age on the 
wStanford-Binct, for example, has been found to correlate on the 
average about .81 with performance on the vocabulary test of the 
scale (46, p. 302). Within single age groups, this correlation ranges 
from .65 to .91. 

From another angle, most intelligence tests may be regarded as 
measures of scholastic aptitude, or ability to succeed in our schools. 
This is illustrated particularly well by the procedure commonly fol- 
lowed in validating intelligence tests. The term “validity,” it will be 
recalled, denotes the degree to which a test actually measures what it 
purports to measure. In the case of most intelligence tests, validity 
has been checked against school success as a criterion. Scores on the 
lest are correlated with school grades or teachers’ estimates of ability, 
and the higher these correlations the more valid the test is said to be. 

It should also be noted that tests of intelligence correlate nearly as 
highly with tests of school achievement as they do with each other. 
For example, the Stanford Achievement Test, a standardized exami- 
nation in such school subjects as reading, arithmetic, spelling, history 
information, and the like, yielded a correlation of .66 with the Na- 
tional Intelligence Test, .71 with the Illinois General Intelligence Test, 
and .79 with the Otis Intermediate Test of Intelligence (24, Ch. 17). 
I'hc correlations of diffcre it intelligence tests with each other run no 
higher than these, most of them falling between .60 and .80. 

All in all, it is apparent that most intelligence tests are heavily 
weighted with certain functions, predominantly verbal aptitude. At the 
same time, they have proved to be of considerable value as empirical 
instruments of prediction in a wide variety of practical situations. In 
forecasting academic promise, aiding in the selection of applicants for 
jobs, and assisting the vocational counselor, they are making a signifi- 
cant contribution. Their usefulness in the large-scale problems of 
selection and classification of military personnel in World War 11 was 
indisputable. That such tests have proved to have empirical validity 
suggests that the criteria themselves may be “overloaded” with certain 
aptitudes. If the tests were not overweighted with verbal ability, their 
validity might drop appreciably, since verbal aptitude undoubtedly 
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plays a predominant role in determining successful achievement in 
our schools, our vocational pursuits, and other everyday life situa- 
tions in our culture. 

A CULl URAL C ONCEPT OF INTELLIGENCE 

Among the many definitions of intelligence which have been proposed 
by psychologists,^^ two concepts recur most frequently. First, intelli- 
gence is characterized as the ability to deal with abstract symbols and 
relationships. Secondly, it is described as tpe capacity to adapt to new 
situations or to profit by experience and is virtually identified with 
learning ability. Most of these definitions sufTcr from the weakness 
that, in their effort to be all-encompassing, they really tell us very 
little. If, for example, we define intelligence as the capacity for ab- 
straction, we are immediately confronted with the fact that the same 
individual may deal effectively with abstract verbal concepts but be 
very deficient with quantitative concepts, or vice versa. Similarly, the 
available evidence olTers no support for the view that “learning'’ is a 
unitary function (cf., e.g., 52). If intelligence were to be dclined in 
terms of the ability to learn, a legitimate question would be, “To learn 
what?" ^ 

It is thus apparent that “intelligence" can be defined only w*' 
reference to a particular setting or environmental milieu. This viev 
point immediately suggests thdt there are, not one, but many defini- 
tions of intelligence. Within our cultural setting, intelligence appar- 
ently consists in large part of verbal ability. It will be recalled that the 
one field from which idiots savants arc conspicuously absent is the 
linguistic one. Success in the practical business of everyday life — for 
both child iind adult — is so closely linked with verbal aptitude that a 
serious deficiency in this respect will brand the individual as mentally 
incompetent. Conversely, the person who is especially proficient in 
verbal functions may thereby compensate for deficiencies along other 
lines and wall rarely, if ever, find his way into an institution for the 
feebleminded. No other single talent seems to be such a saving grace 
in our society. Because of its intimate association with our concept of 
“general intelligence," verbal aptitude does not ordinarily enter into 
our classification of special talents or defects. Children who are defi- 

For a survey of diflerent definilions of intelligence — early and recent — cf. IS, 
17, 33, 41, 42, 44, 51 . 
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cient in reading or verbal expression are usually inferior on intelligence 
tests (20). On the other hand, case studies of juvenile authors have 
invariably shown them to be children of very high 10 (21). To define 
intelligence within our culture is primarily to catalogue those activities 
which are made possible by linguistic development. 

In summary, the data of the present chapter suggest that the indi- 
vidual’s abilities in the verbal, numerical, spatial, musical, and artistic 
fields arc relatively independent of each other. Of these, the verbal 
and — to a lesser extent — the numerical aptitudes are most closely 
identified with the concept of “intelligence” in our culture. In the 
chapter which follows, we shall inquire more intensively into the 
iaentilication and interrelation of the psychological “traits” which 
have been suggested in the present chapter. 
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CHAPTER 

15 


In its most ellmkntary terms, a trait may be regarded as a cate- 
gory for the orderly description of the behavior oF individuals. The 
concept of trait is concerned with the organization and interrelation- 
ships of behavior. 'lYaits are therefore identified by observing or meas- 
uring varied behavior manifestations of the individual. Traits also 
refer, as a rule, to relatively enduring characteristics, which thus have 
some predictive value. Moreover, they usually cover those character- 
istics in which individuals dilTcr appreciably froin one another. Lastly, 
a cultural frame of reference is also evident, although not always 
staled, in most trait classifications. It is those aspects of behavior 
which are import anl within a particular culture (’•r environmental 
setting which are generally identified and described as traits.^ 

'Ihcories of trait organization are very old. As long as philosophers 
have discussed the nature of “mind,' they have proposed theories 
regarding the units into which the ""mind’’ was subdivided. With these 
speculations, however, we arc not concerned. Ft is only since the ap- 
plication of psychological tests and quantitative methods that the 
relationships among the varied behavior manifestations of the individ- 
ual could be measured. The more recent theories have been developed 
as interpretations of specific evidence and thus have a more empirical 
foundation. 

MA.IOR I’HEORICS 

The Two-Faclor Theory. The problem of trait organization was first 
placed upon an empirical basis with the publication of Spearman's 
1904 article (8<S ) in which were presented a theory and a new method 
of investigation. According to the original formulation of Spear- 

* Even ihc trait names in our language have a cultural origin and Li turn influence 
our selection and definition of trails. 
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man’s Two-Factor theory (89, 91), all intellectual activities have 
in common one fundamental function which is called the general 
factor, or g. In addition, each activity has specific, or s, factors. The 
A’ factors are considered to be exceedingly numerous and strictly spe- 
cific to each activity of the individual. No two activities can share 
specific factors, by definition. Spearman argued that such a theory is 
consistent with correlation results. Thus tlie presence of different 
specifics in every activity would explain the absence of perfect f- 1.00 
correlations: no two activities, however much they may depend upon 
the g factor, are entirely free from specifics. The fact that most 
abilities are positively correlated, on the other hand, is attributed to 
the ubiquitous g. Different proportions of g and .v in each activity 
Wv'uld produce a wide range of positive correlations, all higher than 
zero and lower than 1 .00. 

It follows from the Two-Factor theory that the aim of mental 
testing should be to measure the amount of each individuaPs g. If 
this factor runs through all abilities, it furnishes the only basis for 
prediction of the subject's performance from one situation to an- 
^.jther. It would be futile to measure specific factors, since each oper- 
ates in only one activity. Accordingly, Spearman proposed that a 
single test, highly “saturated” with g. be substituted for the hetero- 
geneous collection of items in intelligence scales. He suggested that 
tests dealing with abstract relations, such as the analogies test, arc 
probably the best single measures of g and c«.)uld therefore be em- 
ployed for this purpose, 

Fn regard to the nature of Spearman offered only tentative 
hypotheses. He proposed that g may be regarded as the general 
“mental energy'’ of the individual and the ,v factors as the “engines” 
through which it operates, or the specific neurone patterns involved 
in each activity. This interpretation of g and .v is not, however, an 
integral nor a basic part of the 1 wo-l-actor theory. It might be noted 
that Spearman's g would also furnish a basis for the popular notion 
of general intelligence. 

Even from the outset, Spearman realized that the Two-Factor 
theory must be qualified. When the activities compared are very 
similar, a certain degree of correlation may result over and above 
that attributable to the g factor. Thus in addition to the general and 
specific factors, there might be another, intermediate class of factors, 
not so universal as g nor so strictly specific as the v factors. Such a 



494 Differential Psychology 

factor, which is common to a group of activities but not to all, has 
been designated a group jactor. In the early formulation of his the- 
ory, Spearman admitted the possibility of very narrow and negligibly 
small group factors. Following subsequent investigations by several 
of his students, he included much broader group factors such as 
arithmetic, mechanical, and linguistic abilities. 

Finally, on the basis of a scries of studies, additional general fac- 
tors were suggested. These include p (perseveration), o (oscillation), 
and w (will), the last extending the theory to the field of personality 
traits. It was also proposed by Spearman, (cf. 89, 91) that whereas 
g represents the total amount of “mental energy” at the subject’s dis- 
posal, p may denote the inertia of such mental energy, and o the 
unsteadiness of its supply. Thus all the proposed general intellectual 
factors could be but different manifestations of the same g factor. 

In the later writings of Spearmam and his followers, the presence 
of all three classes of factors — general, group, and specific — is clearly 
recognized. The chief differentiating feature of the later form of the 
Two-Factor theory thus seems to be its relative emphasis upon the g 
factor as a more important influence than the group factors in pro- 
ducing correlation. It should also be noted that the distinction between 
general, group, and specific factors is probably not so fundamental 
as may at first appear. For example, if the number or variety of tests 
in a battery is small, a single “general" factor may account for all 
the correlations among them'. But when the same tests arc included 
in a larger battery witli a more heterogeneous collection of tests, the 
original general factor may now appear to be only a group factor, 
common to some but not all the tests. Similarly, a certain factor may 
have occurred in only one of the tests in the original small battery, 
but may be shared by several tests in the larger battery. Such a factor 
would have been identified as a specific in the original battery, but 
would become a group factor in the more comprehensive battery. 
It is probably more realistic to speak of group factors of varying 
extent, rather than of sharply differentiated general, group, and 
specific factors. 

The Sampling Theory, The Sampling theory cf trait organization 
has been most clearly and completely formulated by Thomson (99) 
and is described by him in a series of publications dating from the 
second decade of the present century. According to such a theory, 
behavior depends upon a very large number of independent elements, 
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which have been variously identified by Thomson and others (102, 
115) with genes, neural elements, stimulus-response bonds, specific 
experiences, or environmental characteristics. Any one activity of the 
individual, it is argued, depends upon a particular sample or com- 
bination of these elements. Correlation results from the overlapping 
of different samples of elements. Different types of factors may thus 
be produced, varying from the specific, through group factors of 
differing extent, to a very broad or general factor. Thomson has re- 
peatedly illustrated, with data from dice throws,^' how^ various factor 
patterns may occur from overlapping samples of independent elements. 

Improvement in an activity with practice, according to Thomson’s 
Sampling theory, is not due to improvement in the elementary abili- 
ties involved, but to the use of a more economical and efficient selec- 
tion of these abilities. As a practical illustration of this, Thomson 
cites the well-known dropping out of unnecessary movements in the 
learning of a motor skill. 

Other viewpoints which bear a fundamental resemblance to that 
of Thomson arc those expressed by E. L. Thorndike ( 100, 106) and 
Tryon (115). Thorndike’s views on trait relationships seem to have 
run the gamut from extreme specificity to the opposite extreme of a 
single general factor (cf. 100-106). Throughout his various state- 
ments, however, one can discern the conception of abilities as being 
ultimately reducible to a large number of simple associative bonds or 
connections, whose role in trait relationships appears to be quite 
similar to that of Thomson's elements. Tryon has expressed a similar 
view, in terms of the operation of a multitude of elementary psycho- 
logical components. The overlap among such elementary components 
produces the correlations between different functions. As further, 
although minor, sources of such co»*relations he mentions possible 
associations between environmental fields and between gene-blocks. 
By the former he refers, for example, to the fact that the individual 
in an inferior cultural milieu may lack many environmental oppor- 
tunities for developing both linguistic and computational skills. Cul- 
tural linkages would thus tend to produce a correlation between these 
two areas. Correlations between independent gene-blocks could occur 
through assortativc mating. Since individuals tend to marry within 
their own general socio-economic and educational level, persons 

^ Frequently employed in statistics as a means of obtaining puiely random or 
“chance” data. 
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superior in quite different respects are likely to interbreed. Their 
offspring would thus tend to receive genes for superior development 
in a number of initially unrelated characteristics. The same type of 
selection would operate in the interbreeding of persons of diverse 
inferiority. 

The Multiple-Factor Theory. The theory held by the largest num- 
ber of contemporary psychologists proposes a relatively small number 
of moderately broad group factors, each of which may enter with 
different weights into dilfcrent tests. For example, the verbal factor 
may enter with a high weight into a vocabulary test, with a smaller 
weight into an analogies test, and with a very small weight into an 
arithmetic reasoning test. Such theories have been variously desig- 
nated Multiple-Factor or Weighted Group-Factor theories.*' 

The publication in 1928 of Kelley’s Crossroads in the Mind of 
Man (63) paved the way for a large number of studies in quest of 
particular group factors. Kelley contended, after a critical analysis 
of the methodology and data of Spearman, that the general factor 
is of relatively minor importance and can usually be attributed to the 
heterogeneity * of the subjects and to the common verbal nature of 
the tests employed. If a residual general factor should bt^found when 
these influences are ruled out, Kelley maintained tiiat it would prob- 
ably be small and insignificant. The major relationships among tests 
he attributed to a relatively small number of broad group factors. 
Chief among these were manipulation of spatial relationships, facility 
with numbers, facility w^ith verbal material, memory, and mental speed. 
This list has been modified and extended by subsequent investigators, 
employing the more recent methods of factor analysis to be con- 
sidered in the following section. 

One of the leading exponents of the Multiple-Factor theory today 
is Thurstone (108-1 14). On the basis of extensive research by him- 
self and his students, Thurstone has proposed about a dozen group 
factors w'hich he designates “primary abilities.” Iliose most fre- 
quently corroborated in the work of Thurstone and of other inde- 
pendent investigators include the following: ^ 

^ For a relatively eaily but clear exposition of the operation of wei^ihted group 
factors, cf. 62, pp. 195 226. 

The influence of heterogeneity upon correlation coefficients wil! be discussed in 
the following section. 

^ For some of the most iclevant investigations dealing with these factors, as well 
as for general summaries, cf. 3, 4, 31, 36, 42, 51, 52, *^3, 83, 108, 111. 114, 126. 
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V: verbal comprehension — the principal factor in such tests as read- 
ing, verbal analogies, disarranged sentences, verbal reasoning, and 
proverb matching. It is most adequately measured by vocabulary 
tests 

W : word fluency — found in such tests as anagrams, rhyming, or nam- 
ing words in a given category (c.g., boys’ names or words with the 
same initial Icttcrl. I'his factor has been identified in relatively few 
investigations. 

N: number — most closely identified with speed and accuracy of sim- 
ple arithmetic computations. 

S: space — it is possible that this factor may represent two distinct 

factors, one covering the perception of fixed spatial or geometric 
relations, and the other ‘"manipulatory visualization” in which 
changed positions oi transformations must be visualized (51). 

A/: associative nicniory — found principally in tests demanding rote 
memory tor paired associates. 'I he evidence is against the presence 
of a broader factor through all memory tests (3, 4, 5 1 , 1 t)S ) . Other 
restricted memory factors through narrowly defined groups of 
tests have been suggested by some investigations (51), 

f\‘ perceptual speed — quick and accurate grasping ol visual details, 
similarities, and difl'erences. This factor may be identical with the 
“speed factor” identihed by earlier investigators (cf. 36, 126) and 
described as “speed in dealing with very easy material.” It may 
also be restricted to visually picsentcd material (51). 

/ (or R): induction (or general reasoning) — the identification of this 
factor is probably least clear. Thurstone originally proposed an 
inductive and a deductive factor (108). The latter was best meas- 
ured by tests of syllogistic reasoning and the former by tests re- 
quiring the subject to find a rule, as in a number series completion 
test. Rvidence for the deductive factor, howe\cr, was much more 
tentative than for the inductiv- Moreover, other investigators sug- 
gest a general reasoning lacto', illustrated by such tests as arith- 
metic reasoning, and fail to corroborate the distinction between 
inductive and deductive reasoning factors (51). 

Rapprochement. The problem of tiait organization has been one 
of the most controversial in psychology. In the 1920's and 1930’s 
the journals fairly bristled with ci/dques, replies, rejoinders, and 
counter-rejoinders. The exponents of each point of view were clearly 
aligned on one side or another. With the gradual sharpening and 
clarification of concepts and the steady accumulation of relevant data, 
a rapprochement has been slowly but unmistakably occurring among 



498 Differential Psychology 


these points of view. It has already been noted above that the dis- 
tinction between the Two-Factor and the Multiple-Factor theories 
today is but one of emphasis and degree. The three types of factors — 
general, group, and specific — are not sharply dill'crentiated, but prob- 
ably represent a continuum of factors of varying breadth. 

A less obvious but equally fundamental convergence has occurred 
between the Multiple-Factor and the Sampling theories. On the one 
hand, multiple-factor exponents have agreed that their factors, rather 
than being unitary ultimatcs, may well represent aggregates of more 
elementary units akin to the elements of the Sampling theory. The 
discovery of a “verbal factor” might thus mean simply that a par- 
ticular combination of response elements, all dealing with verbal 
material, was discernible. The term '"funciional unity,"'' recently pro- 
posed by Thurstonc (109) for such an aggregate, typifies tnis point 
of view. Its connotations appear to be essentially the same as those 
of Tryon's concept of "'operational unities"'' among the elementary 
psychological components (116). Thomson, too, beginning from the 
opposite extreme, has drawn closer to the muitiple-lactor view (99). 
Although he originally assumed the sampling of elements by different 
functions to be completely random, he subsequently pK^posed that 
the elements are organized into fairly enduring '\siib-poois of the 
mind"" These sub-pools into which the elements arc structured or 
organized would acccnint for the correlations within each area, such 
as the verbal, numerical, or spatial. With the “primary abilities” of 
the Multiple-Factor theory broken down into functional or opera- 
tional unities consisting of numerous uncorrclatcd elements, and with 
the elements of the Sampling theory organized into sub-pools, the 
rapprochement appears to be virtually complete. 

In keeping with these revised concepts, too, is the point of view 
which Burt has been championing for many years (8, 19, 20). Fac- 
tors, according to Burt, should be regarded as principles of classi- 
fication or descriptive categories rather than as causal entities.^’ The 
factors cannot explain the test scores. Rather can it be said that the 
test scores account for the factors, since the latter arc derived from 
the test scores. Essentially, factors represent a summary and simpli- 
fied statement of the information contained in test scores, and therein 
lies their practical value. The function of statistically derived factors 
as a simplification of behavior descriptions will be more clearly appar- 

® For another expression of the same point of view, cf. 7. 
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ent when we consider techniques of factor analysis in the follow- 
ing section. 

METHODOLOGY 

The Tetrad Criterion. Fundamentally, all techniques for the study 
of trait organization are based upon the correlation coeflicient. This 
measure indicates the degree of relationship between two sets of 
scores, or the extent to which each individual’s performance on one 
test corresponds to his performance on another test. Correlation, 
however, cannot analyze the mutual interrelationships of a large num- 
ber of variables. A correlation coefficient may indicate whether there 
IS some factor common to a pair of tests, but it cannot show^ the 
presence of a single common factor through three, or four, or any 
larger number of tests. Let us suppose that all intcrcorrelations among 
three tests have been computed, with the following results: ‘ 

r^o -- .60 

*49 

r “ -^O 

Although all three correlations are positive and rather high, we can- 
not determine whether these three tests have one common factor or 
several common factors among them. Test 1 might share one factor 
(A) with test 2, and a different factor (B) with test 3; a still different 
factor (C) might constitute the common clement between tests 2 
and 3. 

It was Spearman (8P^ vdio first denumstrated that from the 
relationships among coireiatiim coefficietiis it is possible to discover 
the factorial organization of any mimber of tests. The first method 
proposed by Spearman was the hierarchical arrangement of correlation 
coelficients. According to this criterion, if it w^as possible to arrange 
all the iiitercorrclations among a set of tests in such a way that they 
decreased consistently in size botli along the rows and along the 
columns of the table, then the relationships among these tests could 
be explained entirely in terms of g and s. This was a relatively crude 
‘‘inspcctional” method of determining hierarchy. Subsequently, the 
computation of the intercolnmnar correlaiion was suggested as a con- 
venient numerical index of hierarchy. The intercolnmnar correlation 

ll is customary to denote the partiLisUir vaiiablos correlated by subscripts. 
Thus, ii> is the correlation between test No. 1 and test No, Z. 
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is simply the correlation between columns of correlation coefficients. 
A -f 1.00 intcrcolumnar correlation would indicate a perfect hier- 
archical arrangement of the coefficients. 

Finally, the intercolumnar correlation was replaced by the tetrad 
criterion. The latter gets its name from the fact that the tests arc 
considered in sets of four. For every four tests, or variables, we can 
compute three tetrad equations as follows: 

ti-ii - — ri:iXrii4 

» i Xr-j 

ti.ui: - ri;; X 

Spearman and others have been able to prove mathematically that 
if all three tetrad equations arc equal to then a single common 

factor is sufficient to account for the relationships among the lour 
variables. 

This was a decided step forward from the simple correlation coeffi- 
cient. It was now possible to analyze the interrelationships of any 
number of variables by computing dilTerent sets of tetrads. The 
extension of the tetrad criterion beyond four variables can easily be 
demonstrated. Let us suppose that wc have administered six tests to 
the same subjects. First, w^e compute the three tetrad equations with 
tests 1, 2, 3, and 4. If all three tetrads are equal to zero, we may 
conclude that the same factor which underlies tests 1 and 2 is also 
common to tests 3 and 4. llicn if the tetrad criterion is likewise satis- 
fied (i.e., all tetrads equal to zero) with tests 1, 2. 5, and 6, we 
know that the factor common to 1 and 2 is identical with that com- 
mon to 5 and 6. Hence the same factor must be common to all six 
variables. 

The development of the tetrad criterion stimul. ued extensive re- 
search on trait organization and was imdoubicdly an important step 
forward in the statistical study of trait relationships, its usefulness, 
however, is iimi<ed. One pra.ctical drawback is tliat, as the number 
of tests increases, the number of tetrads to be com|_)uted becomes 
excessively large. With only 10 tests, for example, there are 630 
tetrads. With 60 tests — the number employed in one of Thiirstoncks 
studies — over a million tetrads would have to be computed, d'he 
technique thus becomes unwieldy in the analysis of a large number 
of tests. Moreover, the tetrad tcchnkiuc docs not piovidc a clear 

^ Or arc sufficiently close t(» zero to be within the error of sampling. 
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over-all picture of the location of group factors, nor does it indicate 
the weight with which these factors enter into each test. With the 
shift in emphasis from a single general factor to a number of group 
factors, the tetrad criterion has been largely supplanted by more 
suitable and expeditious methods of factor analysis. 

Factor Analysis. Several alternative procedures for analyzing a 
set of test scores into their constituent factors have been developed 
by Kelley (64), Hotelling (61), Burt (19), Holzinger (59), 1'ryon 
(116), Thurstone (112), and others.'^ Although differing in their 
initial postulates, most of these methods yield results which arc not 
too unlike. The most widely used technique is the centroid method 
developed by Thurstone ( 1 12). In common with all the other meth- 
ods, this technique begins with a complete table of all the inter- 
correlations among a set of tests. Such a table is known as a correla- 
tion matrix. In the process of factor analysis, tliis correlation matrix 
is transformed into a jacii>r matrix, giving the weight or loading of 
each factor in each lest. 

An example of such a factor matrix will be found in Tabic 28. 
This matrix was derived from the intcrcorrelations of 21 tests given 
to 437 seventh and eighth gr^dc school children tll4). The seven 
factors listed at the top of the table are the same as those described 
on page 497 and arc indicated by the same letters. 4'he entries in 
the body of the table show the loading of each test with each of these 
factors. Tor example. Test 1, identual niitnbers, has signiticant load- 
ings of .42 and .40 with the perceptual speed and number factors, 
respectively; Test 8, vocabulary, has a high loading of .66 with the 
verbal comprehension factor and negligible loadings with all the 
other factors; Test 16, addition, has a single significant loading of 
.64 with the number factor. All the lesiduals, given in the last column 
of the table, arc small, indicating that substantially all the correla- 
tion among the tests can be accounted for by the seven facU^rs shown. 

Rotation of Axes. The ractor weights found by the centroid 
method represent a ‘T’enter of gravity"” or a sort of average value 
based on all the correlations. As each faetor is extracted, the residual 
correlations are subjected to the Sc me type of analysis in order to 
obtain the weights of the next centroid factor. Thurstone ( 109, 112) 

For a survey of the assumplions underlying the various methods and a clear 
introduction to techniques of factor analys;s, cf. WoUlc (126) and Guilford t4S, 

Ch. xrv). 
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has argued that the factors thus located do not usually correspond 
to meaningful categories, and he therefore advocates the rotation of 
axes following the centroid factor analysis. The factorial matrix 
reproduced in Table 28 has already been nnated. 



“I 


Original Centroid Axes I, H Rotated Orthogonal Axes. I,' II' 

Fig, 83. Rotation of Axes: Orthogonal, (From Garrett, 42, p. 261.) 

Factors can be visualized geoinetricallv as axes in terms of which 
each test can be plotted or described. Retercncc to Figure 83 will 
make this interpretation of factors i. 'irer. For simplicity of illustra- 
tion, only two factors, or axes, are included in this figure. Three 
factors would require a three-dimensional representation, and four 
or more factors could not, of course, be represented directly in geo- 


Either by the ccnlioid method or by other similar techniques of factor analysis. 
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mclrical space, although they can be handled conceptually and mathe- 
matically by an extension of the same principles. In Figure 83, the 
factor loadings of the following nine tests have been plotted with 
reference to the two centroid axes, 1 and II: 

1. Vocabulary 6. Arithmetic Reasoning 

2. Opposites 7. Number Series Completion 

3. Analogies 8. Equation Relations 

4. Sentence Completion 9. Multiplication 

5. Disarranged Sentences 

The factor loadings were found from the intercorrclations of the 
scores of 210 college men on these nine tests (83, 42). Reference to 
Figure 83 will show, for example, that Test 1, voccihuhiry , has a 
loading of .776 with Factor I (distance along horizontal axis) and 
a loading of .479 with Factor II (distance along vertical axis). Thus 
by plotting each test we have “’described” and located it with refer- 
ence to the two centroid axes. 

It wall be noted that nearly every test in Figure 83 has an appre- 
ciable loading with both ftictors and that several negative weights 
are present. A number of psychologists regard negative loadings as 
“psychologically meaningless/’ because such loadings imply that 
the higher an individual rates in the particular factor, the ptn^rer will 
be his performance on the test. Centioid factors are also dilTicult to 
describe in psychological Ic^rms. The two centroid axes in Figure 83, 
for example, do not lie close to any cluster of tests through v/hich 
the factors might be identified. It was as a solution for these difficul- 
ties that 1'hurstone proposed the rotation of axes after factoriza- 
tion (109). 

It should be noted that a given set of tests can be described in 
terms ot an infinite number of coordinate systems or axes. I'he situa- 
tion is not unlike that of locating points in geography ( 125). Longi- 
tude ciuilcl be measured from Chicago instead of Cireenwich as a 
point of origin, and latitude could be measured from the North Pole 
instead of the Equator. In factor analysis, the choice of axes depends 
largely upon the objectives of the analysis. Thurstonc has proposed 
what he terms "\siniple structure'''' as a criterion for locating the most 
suitable axes for the analysis of psychological tests. By this he means: 

At least in the measurement of abilities: the possibility of genuine negative 
loadings seems more acceptable in connection with personality traits. 
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(a) eliminating negative factor loadings as completely as possible, 
and (b) maximizing the number of zero (or near-zero) factor load- 
ings. The latter condition implies that each test shall be described 
by the smallest possible number of factors, since a zero loading simply 
means that the factor does not enter into that particular test. Thurs- 
tone maintains that when the axes arc rotated in such a way as to 
maximize the zero loadings, most negative loadings also disappear 
and the resulting factors are relatively easy to interpret or identify 
in psychological terms. Referring back to Figure tS3, we may note 
that the rotated axes V and IF, which fulfill the conditions of simple 
structure fairly closely, seem to correspond to the familiar V and 
N factors. Rotated axis F runs close to a cluster of verbal tests (Tests 



Original Centroid Axes. I, 11 Rotated Oblique Axes: 1/ H' 

Fig. 84. Rotation of Axes: Oblique. (From Garrett, 42, p. 262.) 
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1 to 5), while axis ir runs close to the numerical tests (Tests 
6 to 9). 

It will be observed that in Figure 83 both axes were rotated over 
an angle of 35°. The two axes thus remained at right angles to each 
other, or orthogonal. An even closer approximation to simple struc- 
ture can be reached ’f each axis is allowed to rotate independently of 
the other, as shown in Figure 84. In this case, the new axis T is set 
at right angles to the line P2, which runs through the cluster of 
numerical tests. Thus the numerical tests have close to zero loadings 
in Factor I'. Similarly, the new axis IF is set at right angles to the 
line PI through the verbal cluster. The two new reference axes, 1/ 
and IF, arc not orthogonal to each other, but represent oblique axes. 
This signifies that the two factors which have been identified arc 
themselves correlated. In the example illustrated in Figure 84, the 
V and N factors were correlated to the extent of .225. 

There is an increasing recognition of the fact that oblique axes, or 
correlated factors, may be just as useful as orthogonal axes in a 
systematic deserij>tion of behavior. Logically, there is no reason why 
the primary categories of behavior must be uncorrelatcct. In physical 
measurement, for example, height and weight have cloftrly demon- 
strated their usefulness as bodily dimensions, despite the fact that 
they are certainly correlated ( 109), 

One last point should be noted regarding the rotation of axes. If 
the factors themselves are correlated, then it should be possible to 
“factorize the factors” and k>cate second-order jaciors. This has 
been done in an analysis of the scores of 710 eighth grade children 
on 60 tests ( 1 14). A single second-order general factor was idenlihcd 
which seemed to be fairly similar to Speannan’s g. Such a finding 
should perhaps be viewed as further evidence of rapprochement 
among the various theories described in the preceding section. 

Some Limitations of Factor Analysis. In interpreting the results 
of factor analysis, certain major limitations of these techniques should 
be considered. First, any set of intcrcorrclations can be analyzed into 
innumerable factor patterns. To demonstrate the presence of certain 
factors simply shows that the tests can be described in terms of those 
factors, not that they must. An analysis into other factor patterns is 
not precluded. To illustrate this point, Spearman (90) cited the 
analogy with a rectangle. Such a figure can be divided into two tri- 
angles, but it can also be divided in an infinite number of other ways. 



Trait Organization 507 


How we divide a rectangle or how we factorize a test batter> depends 
upon the nature of the problem. The various methods ot factor 
analysis differ m the limiting conditions which they impose m order 
to reach a determinate solution. The appropriateness of these con- 
ditions to the problem under investigation should determine the 
choice of method 

Secondly, since all techniques of factor analysis begin with inter- 
corrclations, it is obvious that any circumstances which atlect the 
correlation coefficient will also affect the results ot the factor analysis. 
It has been repeatedly demonstrated, both empiricallv and theo- 
retically,^- that the size oi a correlation cocthcient is affected by the 
hcteioi’cncity of the gioup ol subjects upon whom the d<ita were 
collected 1 he most obvious example is that ol age heterogeneity. 
Jf the subjects range m age from 3 to 15 years, a high positive cor- 
lelation will be loimd between even sueh diverse charac tciistics as 
size of the great toe and Stanfoid-Binet mental age Ihc same two 
measures would yield a zero correlation within a homogeneous age 
group such as, foi txample, 1()->ear-old children. 

Nor docs heterogeneity always raise the correlation coefficient; it 
may lower it let us suppose, for example, that a group ot high 
school boys and a group ot high school girls have eaeh taken two 
memory tests, one based on a sports story, the other on a lashion 
stoiv I ct us further suppose that the correlalion between the two 
memorv tests is 40 among the bo\s and also .40 among the girls 
Ihc girls as a group, howcxei, will piobably s^eue higher than the 
boss on the fashion test Uiilc the boys will excel on the sports test. 
If, now, we compute a single coi relation between these two tests in 
the combined group ol boys and girls, the resulting coefficient will 
be much lower than 4(^ Ihc gre iter the sex chffcTcnec on the two 
tests, the more will me correlation be loweied by combining the 
two groups 

It is likewise possible foi heterogeneity to produce a negative 
coi relation between two variables which ..re actually uncorrclated 
Thus, it a heterogeneous group composed ot Chinese and Scandi- 
naxians were fated lor height and 1» i prohcicncy m the use ol chop- 
sticks, a fairly high negative correlation would be obtained between 
these two measures. ^ he C hmesc would, in general, be shorter than 
the Scaniinavians and dctmiicly more adroit with chopsticks Within 

’-Cl eg "^2 44 
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either group, however, we should scarcely expect any correlation 
between these two characteristics. 

Correlations which result from a marked degree of heterogeneity 
in the group arc usually regarded as spurious correlations. It is dif- 
ficult to decide, however, just what constitutes a permissible degree 
of heterogeneity. Obviously, all heterogeneity should not be elimi- 
nated, even if this were possible, since individual diflerences would 
thereby disappear and correlation would be meaningless. The desired 
degree of heterogeneity must be determined on the basis of the par- 
ticular problem under investigation. It shciuld always be remembered, 
however, that correlation cocOicienls, or any statistical measures 
derived from them, must be interpreted in the light of the particular 
group upon which they were obtained. 

APF'LIC ATIONS OF FAC TOR ANALYSIS 

Special Areas of Ability. Factor analysis began as a technique 
for studying Ihc organization c^f intelligence. As a method, however, 
it is proving to be applicable to an increasing number of widely 
diverse questions. Intensive studies of special areas of ability have 
been under way to supplement the broad surveys of the earlier in- 
vestigations. Studies of learning among adult subjects, for example, 
have so far failed to reveal a general learning factor (127, 128). 
Gains tend to be specific: the same individual may be a relatively 
rapid or good learner in one task and a slow or poor learner in 
another. 

A factor analysis prr('e/)tion, with both paper-and-pencil and 
laboratory tests, disclosed several significant factors within this area 
(110). Among them were reaction time, speed of perception, speed 
of judgment, rate of reversals in ambiguous figures, speed of closure, 
and flexibility of closure.'’' I'he last two arc of special interest because 
of their suggested association with certain intellectual and personality 
characteristics. For example, a survey of a group of administrators 
in Washington showed that the more successful administrators scored 
relatively well on some of the closure tests ( 1 10). A tentative hypoth- 
esis, suggested to account for these results, was that the successful 

^'^“Closuie” IS a term intfoduccU in Ihc Gcslalt sUidics on perception lo letcr to 
the well-known perceptual (illin« of incomplete fij’uies. Thu.s n sketch ol a house will 
be perceived as a ct^mplclc liouse even thoiijdi the drawing itself ina> contain 
many gap^. 
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administrator is an individual who can most readily unify the appar- 
ently unrelated elements in the work which he must coordinate. That 
the previously identified perceptual speed factor (P) can itself be 
broken down into a number of subsidiary factors has been demon- 
strated in more than one study (13, 110). 

In a special study of verbal tests, the previously identified factors 
of verbal comprehension (V) and word fiuene> (W ) were further 
subdivided into three and tw^o factors, respectively, and three new 
verbal factors were isolated (24). Thus eight new factors of more 
restricted scope than the original V and W were identified in verbal 
tasks. Examples of these factors include the individuaTs stock of 
linguistic responses, facility and fluency in oral speech, and speed of 
articulatory movements. A factorial analysis of fluency in writing 
(95) also suggested new factors, such as “ideational fluency’’ and 
“verbal versatility.” Intensive factorial investigations of reasoning 
tests have likewise suggested the presence of several uneorrelated 
reasoning factors in such tests (34; 119, No. 5). Whether any one 
of these factors is common to all reasoning tests is still a moot point. 
Attempts to identify a factor of flexibility, or the ease with which the 
individual can shift from one task to another, have so far failed to 
disclose any such factor; the subject’s ability to shift seems to depend 
entirely upon the specific content of the tests (65). 

INon-lntcIlectual Functions. Some application of factor analysis 
has also been made to non-intollcctual functions. A number of inves- 
tigations have been concerned with motor functions, including both 
manipulatory skills and athletic proficiency ( 16, 21, 38, 66, 86, 87). 
Such functions have on the whole proved to be highly specific, the 
intercorrclations among dificrenl motor tests usually being quite low. 
Certain relatively narrow factors have, henvever, been identified. 
Steadiness tests, for example, have repeatedly shown a common fac- 
tor. There is also some evidence for group factors underlying im- 
provement in motor functions, speed of isolated reactions, finger and 
hand speed in restricted oscillatory mo^^ments, and forearm and 
hand speed in restricted oscillatory movements. In general, the factors 
most commonly found in motor fimetions seem to be related not to 
the specific muscle groups, body parts, or sensory modalities involved, 
but rather to similarities in pattern or type of movement (86). 
Finally, it should be noted that the more complex motor tests often 
contain the previousl} identified space factor {S). Moreover, several 
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analyses of common mechanical aptitude batteries show them to 
involve principally the space factor (5), perceptual speed (P), and 
various motor factors (47, 58, 125). 

Factorial studies of sensory tests have also been undertaken, in- 
cluding visual acuity and other widely used measures of visual 
efficiency (118, 1241. One finding of practical significance in this 
connection is that different visual tests designed to measure the same 
characteristics may have different factorial compositions. As a result, 
such tests will not be equally valid for different purposes. Among 
non-intellectual functions, mention should also be made of the rap- 
idly growing application of factorial methods to pctsonality measure- 
ment. Some of the problems and results in this area will be considered 
in a subsequent section, but it might be noted that factor studies have 
been contributing to the breakdown of the traditional demarcation 
between personality and intelligence. The same test may measure 
factors in both categories. Nor are personality and ability variables 
wholly unrelated within the individual. There is some evidence, for 
exmnple, which suggests a relationship between ability in drawing 
mm certain personality characteristics (27). 

Academic and Vocational Areas. A further application of fac- 
torial techniques is icprescnted by the analysis ol perlonnance in 
various academic and \ocational areas. This use of factor analysis 
has important practical iniplipations for personnel selection and coun- 
seling, since it places the construction of aptitude test batteries on a 
firmer and more systematic foundation. Let us suppose that we want 
to disco\cr what factors aic involved in successful performance in 
courses in elementary French or calculus, or in the occupations of 
filing clerk, cabinet maker, or city editor. The procedure would be 
to a'^semble a trial battery of tests sampling all the major factors and 
then to factorize the criterion measure along with the test battery. 
Thus final grades and achievement test scores in French or calculus, 
or follow-up records of job performance, would be included as one 
variable in the correlation matrix. 1 he factor matrix wall then show 
the loading of the criterion with each factor. For example, if variable 
1 in Table 28 had been a criterion measure rather than a test score, 
we could find the contribution of each of the seven factors to this 
criterion by simply reading across the first row of the table. The next 
step would be to choose those tests in the battery which arc most 
heavily weighted with the factors that predominate in the criterion. 
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These are the tests that will be most effective in predicting successful 
performance in the educational course or occupation under con- 
sideration. 

7’his type of analysis has been conducted with performance in such 
scholastic areas as algebra (15, 78), geometry (60), and technical 
courses (35). A good example of its use in military psychology is 
provided by the factorial analysis of pilot performance conducted by 
the psychological staff of the AAF (37, 51, 52, 53, 119). In this 
project, nearly 30 factors were identified, covering abilities, interests, 
emotional characteristics, and educational and other background vari- 
ables. Factor analysis likewise constituted the basic technique followed 
by the United States Employment Service in devising its General 
Aptitude Test Battery (40, 92). Preliminary batteries consisting of 
15 to 29 tests were administered to nine groups totaling 2156 men 
between 17 and 39 years of age. Most of the men were trainees in 
vocational courses. I'he 10 factors idcntiiicd most clearly and incor- 
porated into the U.S.E.S. battery have already been cited in the pre- 
ceding chapter. They include: 


G — general intelligence 
V — verbal ability 
N — numerical ability 
S — spatial ability 
P — 1 orm perception 


Q — clerical perception 
A — aiming 
7 - - motor speed 
F — linger dexterity 
M — manual dexterity 


Miscellaneous Applications. The possible uses of factorial tech- 
niques in psychology and related fields are many and varied. The 
factorial analysis of bodilv dimensions in the study of constitutional 
types has already been mentioned in Chapter 13. Other proposed 
applications range from the classification of psychiatric syndromes 
and the simplification of scratch b.sts for allergy to the analysis of 
voting records, Supreme Court decisions, and stock market fluctua- 
tions (109, 113). In most of these areas, exploratory research has 
already begun. Factor analysis is also being currently employed as a 
technique for simplifying “job evaluation” systems in business and 
industry (67). 

In certain applications of factor a.icjlysis, an adaptation known as 
obverse or inverted factor technique has been employed.^ ’ This simply 


Burt has pointed out that, strictly speaking, this technique is based upon a 
transpose rather than upon the imetse of the usual matrix of measurements, since 
the rows aie written os columns (19, p. Ie9). For u discussion of the technique, 
cf. 19, C'h. VI; 93. 
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means that the original correlations are correlations between persons 
rather than between tests. Thus all the scores of individual A on, let 
us say, 30 tests are correlated with the scores of individual B on the 
same 30 tests, yielding rAu. Similar correlations are found for every 
other pair of individuals in the group. These intercorrelations then 
form the basis for a factor analysis by any of the usual techniques. 
Inverted factor analysis has been proposed especially as a means of 
investigating personality types, since the “group factors for persons” 
would then represent “type factors” or patterns of traits shared by 
certain individuals. In some situations, as when an extensive series 
of measures is available on a relatively small number of persons, 
inverted factor technique may be preferable. It docs not seem, how- 
ever, that the two approaches should be regarded as fundamentally 
different. Substantially the same factors would probably be found 
by either approach (19). 

GROUP DIFFFRKNCES IN FACTOR PATTERNS 

With the extension of factor studies to subjects differing in age, sex, 
education, occupational background, and other characteristics, certain 
consistent group differences have come to light. What at first ap- 
peared as a source of confusion and controversy is now gradually 
falling into a systematic picture. Through the comparison of factor 
patterns in diverse groups, inorcover, we may learn something about 
the nature of trail relationships and how traits develop. As early as 
1927, Spearman called attention to such group differences, stating, 
"‘Another important inffuence upon the saturation of an ability with 
g appears to be the class of person at issue” ((S9, p, 217). At that 
time he also reported some data suggesting that among older as well 
as among brighter individuals, abilities are more specialized and the 
general factor plays a relatively smaller part. It is interesting to note 
that a large number of the studies by Spearman and his students were 
conducted on school children, a fact which may partly account for 
the insistence of these investigators upon the importance of the g 
factor. Most of the early studies by the American group-factorists, 
on the other hand, were concerned with college students. The latter 
found little or no evidence of a genera! factor, and put the major 
emphasis upon a few broad group factors. 

^■■"For a more detailed survey of these di/Terenccs, cf. 9. 



T rait Organization 5 1 3 


Age. A number of independent investigations are now available 
which indicate that abilities do in fact become more specialized as 
the child grows older (43). Among preschool children, the general 
factor appears to be relatively large, and group factors less important. 
For example, in a study on 200 5- and 6-ycar-old children (17, 42), 
various memory tests were as closely related to a vocabulary test 
and to Stanford-Binct MA as they were to each other. In contrast 
to this, at the college level simple tests of associative memory present 
a distinct group factor wliich breaks olf sharply from V, N, and other 
group factors (3, 4, lOS). Thus the correlation between vocabulary 
and the entire memory battery was .43 among the preschool chil- 
dren, but only .06 among the college students (43). 

Similarly, in a re-analysis of two different studies (82, 83), Garrett 
(42) found a correlation of .83 between the V and N factors in a 
group of third and fourth grade school children, in contrast to a 
correlation of only .23 among college students. In the Thur stones’ 
extensive study of 710 eighth grade school children with 60 tests 
(1 14), much higher correlations were found among the group factors 
than had been found in the earlier study on college students by one 
of the authors (1 08). For example, the N factor correlated .33 with 
word fluency; 1 correlated .43 with S and .42 with V. The correla- 
tion of V with W was .42 and with S .38. In the college sampling, 
ail factorial correlations were negligible, the median correlation being 
.03 and the highest .24 (108, p. 100). In the eighth grade sampling, 
furthermore, a second-order general faclor was identified whose cor- 
iclations with the first-ord ^ group factors ranged from .14 (with M) 
to .72 (with F). 

A few studies have been specifically designed to discover the role 
of age in trait relationships, in on*" of these (43), three groups of 
school children, aged 12. and 13, respectively, were given tests 
of memory, verbal, numerical, and spatial aptitudes, and motor speed. 
The intcrcorrelalions among these tests tended to decrease from the 
youngest to the oldest group. Factor pattern analyses revealed a gen- 
eral factor whose average contribution to the total battery dropped 
with age. Among the boys, the average per cent contribution of the 
general factor was 31, 32, and 12 for ages 9, 12, and 13, respec- 
tively. For the girls, the corresponding per cents were 31, 24, and 19. 
These results were corroborated in a later study using the Thurstonc 
Tests of Primary Mental Abilities with 1 1-, 13-, and 13-ycar-old 
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boys. The latter study confirmed Thurstone’s findings of a second- 
order general factor, and indicated that the influence of this factor 
drops with age (29). In still another study (11), a single group of 
children was retested, the average age at the two testings being 9 
and J2. Light tests covering verbal, numerical, and spatial content 
were administered. Tntercorrclations dropped from the first to the 
second testing, the decrease being larger in the correlations between 
verbal and numerical tests than those within either group. Factor 
pattern analyses corroborated the findings of other studies: a large 
general factor was found at both age levols, but its magnitude dropped 
from age 9 to age 12. 

The standardization data of the Wechslcr-Bellcvue Intelligence 
Scale provide some information regarding age changes in factor pat- 
tern among adults (12). The average intcrcorrclalion of the sub- 
tests in this scale dropped steadily from the 9-year-old group to the 
2.S-29-year-old group, thus corroborating the results of other studies. 
In the 35--44-ycar group, however, it rose to .31, and in the 50-59- 
year group it rose again to .43. Factor analyses showed evidence of 
a predominant general factor in the 9-ycar group and again in 
the age group 50-59, while in the intervening ages group factors 
played the major part. Thus in this study, specialization seemed to 
reach a peak during the middle age levels; in both the younger and 
the older groups, generalization of ability seemed to be the rule. 

One of the first hypotheses to account for age changes in factor 
patterns was that proposed by Kelley (63). The presence of a general 
factor in childhood, according to this explanation, results from indi- 
vidual dilfcrenecs in rate of intellectual inaluration. Thus the child 
whose mental development is sknver would have relatively low scores 
on all the tests, while the faster developing child would have higlier 
scores throughout. By an extension of the same hypothesis, tlie in- 
creasing weight of the general factor beyond maturity might be attrib- 
uted to individual diflercnces in the rale of mental decline. One 
objection to such a hypothesis comes from our present knowledge 
of intellectual growth. It will be recalled from Chapter 9 that dif- 
ferent functions arc quite independent in their development, and that 
it is unlikely that the individual is characterized by a general “rate 
of growth” or “rate of deterioration” for all abilities. Moreover, age 
changes need not //up/y maturational processes. We cannot assume 
that the same changes would occur regardless of what individuals did 
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during those years. That the latter is in fact important is suggested 
by some of the data to be considered in the lollowing sections 

Education. In discussinc; some of the later developments of his 
Sampling theory, Thomson wrote “ . a general tendene> is 
noticeable in experimental reports to the eftc-cl that batteries do not 
permit of being explained bv as small a numbei ol factors m adults 
as in children, probtibly because in adults education and vocation 
have imposed a structure on the mind which i** absent in the 
young . . . Some ol this \tiuetuie is no doubt innate, but moie c^f 
It IS probablv due to environment and edueatiiui and life’ (^9, 
pp 306, 319) Whit IS the eviaencc 1oi the influence ot educa- 
tion m this increasing ‘ stiuetuniiL ol abilities from childhood to 
matiiritv^ 

I irst it should be noteci that in all the studic s on school children 
and college stuc^eiit^ discussed above tin (>ldcr i'loup^ m utiahly 
had niou education Thus the IS v^tr-olds have had more education 
than the 9->cai-oids, and tiu college students ot course have had 
more than anv other eioup I ven more couent is the fact that, in the 
Wcehslcr Bellevue data changes in 1 letor patterns among older 
prisons closely parilldcd education il diflcrtnccs The 2S 29->car 
gioiip, showing the greater ‘'pLcnh/ iiion ol ability, iilso hid the 
highest cdue ition, with a range Ironi one to tour >( ai^ of high school. 
The 44 vlji gioup, whiv^h unged m education fiom the sixth 
grade to the hist ycai cd high school, shc'iwcd less spcL^iali/ation cT 
ability The oldest group with the le ist speeiali/aiion, ranged in 
education irom the fifth th^ eighth gi«idc An> ot these changes 
or group chncienccs in Ir nt oigim/ition cc uld thus be explained 
equilly well in terms of cdncilion t age As long is both variables 
are present, we cannot ehoo c bet ecu them on the basis of such 
data alone 

What IS needed is a compaiison between diflerent age groups of 
the same education or between dilTcrent educational gioups of the 
sm^e age Some lelevant prelimmai) data oi this sort were provided 
bv the armv testing in Woild War IT Within a sampling of 3000 men, 
eaiefully chosen so as to be lepresen ative c'lf the entire army, intcr- 
corrslations were computed ameng the sub-test scores of the AGCF 
(Form 3a) and of tfic \rm} Mechanical Aptitude Test (77) The 
avciage age of this group was 27, and their average educatum 91/2 
grades In Table 29 will he found the mtcreorrelations of the parts 
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of the AGCT. It should be noted that Form 3a of this test is especially 
suited to such a correlational analysis since, unlike the shorter forms, 
it consists of separate sub-tests, each of which is timed separately. 

TABLE 29 Intcrcorrclations anion^ the Sith-'l c\ts of the AGCT 
in a Random Sample of 5000 Cases 


'iJ.il.i (ViMjiiPcl Ktsi:url' A ('-(), 77) 


T csLs 


3 

4 

1. KcLiding anJ VtKabuIaiy 

.81 

.81 

.71 

2. Aiithmclic Conijiiilalion 


.90 

.73 

3. Arithmetic Keasoiung 



.15 

4. Paltcin Analysis 





Table 30 shows the intcrcorrclations of the parts of the Army Mechan- 
ical Aptitude Test with each other and with total AGCT scores. 

These correlations arc much higher than those found for similar 
tests among college students. But even more conspicuous is the 
relative uniformity of the correlations, regardless of lest content. 
Such uniformity suggests that the relationships could pQ expressed 
in terms of a single general factor. The tetrad criterion, for ex- 
ample, would be readily satisfied when all correlations are nearly 
alike. Fspceially interesting arc the correlations of die three mechan- 


TABLE 30 Intercorrelatioi.s tmioin^ Armv Mechanical Aptitude 

Suh-f ests iifid AGCT iotal Si ore in a Rarilom Sample of 5000 Cases 

( I Kit:i r/'.ni [ ‘c . ''Oiiiii 1 (vt t.i'ili 77) 


J i'Sf^ ? 

3 

4 

1. Mechanical info-mation 67 

.78 

.11 

2. Surface Development 

71 

.76 

3. Meehan !Lai C omprehensi a 


77 

4. AGCT- 3: 'Tolal Seoic 




ical aptitude sub-tests. Tt will be seen in Table 30 that these tests 
correlate .67, .78, and .71 with each other, and they corielatc .77, 
.76, and .77 with AGC'^T total scores. The correlations of the mechan- 
ical tests with the separate sub-tests of the AGCT were nearly as 
high, ranging from .65 to .72. From an examination of such cor- 
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relations alone, it would be impossible to pick the correlations be- 
tween two mechanical tests and those between a mechanical and a 
verbal or numerical test, since they are all so nearly alike. To be sure, 
it is quite probable that all these correlations were spuriously raised 
by inadequate control of testing conditions. Thus if a given individual 
was incapacitated by illness, fatigue, or other physical discomfort, his 
scores on all parts of the AGCT would be lowered by about the same 
amount. If, moreover, the Mechanical Aptitude Test and the AGCT 
were given within a short time of each other, the same disturbing con- 
dition might alfect performance on both tests. Any uncontrolled fac- 
tors in test administration, such as distractions or improperly given 
directions, would likewise tend to raise or lower the scores of a 
particular group on all parts of a test, thus raising the intercorrclatiors 
and making them more uniform. It is doubtful, however, whether such 
spurious factors could account for the major part of the c/btained 
correlations. It seems reasonable to expect that even if such factors 
had been controlled, the intcrcorrelations in Tables 2d and 30 would 
still be much higher than those found among similar tests given to 
college groups. 

What such findings suggest is that adults whose educational level 
is no higher than that of children resemble dnidren much more than 
they do college students in their trait relationships. Among persons 
of lower educational levels, irrespective of age, abilities appear to be 
less highly dilferentiated and the general factor is relatively con- 
spicuous, 

Sev, Some data on sc' dilTcrcnces in trait relationships are also 
available. In an early Hnglish study (72) on mechanical aptitude in 
school children, for example, the b>'ys' seo,rcs on the various spatial 
tests correlated more highly with ca 'h other and less highly with esti- 
mates of "‘general intelligence’’ than did the girls’ scores, d'he author 
suggested that spatial Uwts depend more largely upon a special apti- 
tude among boys, and arc moie largely inlluenccd by “general intelli- 
genee” among girls. In a later study (74) on 1 year-old linglish sciiool 
ciiildren, a factorial analysis identifiv'd a spatial factor among the 
boys but nc't among the girls. This arvestigator likewise eoncludcd 
th.at ^ome of the tests involving a >paec factor for the boys were per- 
formed by the girls “by means of their general intellectual facility.” 

Corroborative data arc furnished by studies on American school 
children at ages and 12 (11, 82). In these groups, the correlations 
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of the spatial tests with verbal and numerical tests were higher among 
girls than among boys. In the same groups, the intercorrelations of 
verbal tests with each other tended to be higher among girls, while 
the intercorrelations of numerical tests with each other were higher 
among boys. These sex dillerences in correlations were larger in the 
12- than in the 9-year-old group, and thus seem to become more con- 
spicuous with age. Several factor pattern analyses of mathematical 
aptitude and of performance in mathematics courses have yielded 
factors which differ in both number and nature for the two sexes 
(15, 78). Studies of memory, conductea with 9-, 12-, and 15-year-old 
children and with college students, suggest that memory operates 
more nearly as an independent trait among women than among men 
(3, 4, 45). The intcrcorrelations of memory tests with each other 
tended to be higher among women; at the same time, the community 
between memory and non-memory tests tended to be greater among 
men. 

Such data suggest that those groups which excel in performance 
within a given area exhibit a more c!osel> knit organization of per- 
formance within that area. In the studies cited, for example, the women 
excelled in average scores on the memory tests and also Jjjiowed higher 
intcrcorrelations among such tests than did the men. Jn mechanical 
aptitude, on the other hand, the men excelled in level of performance 
and showed higher intcrcorrelations among such tests than did the 
women. Moreover, the correlations between spatial and non-spatial 
tests were lower among men than among w\)mcn. It is possible that 
the same conditions which juake for good performance along certain 
lines tend also to unify and crystallize such performance into a dis- 
tinct “trait.” 

Other Croup Dilfcrcnces. Factor patterns among different occu- 
pational groups also offer interesting fields for research. Very little 
information is available in this area, although a few investigations 
offer promising leads. For example, among groups of adult men, the 
intcrcorrelations of three manual dexterity tests were consistently 
higher among operatives in repetitive tasks than among clerks or 
skilled trades workers (9, 96). The average correlations were .41, 
.26, and .25, for operatives, clerks, and trades workers, respectively. 
On the other hand, the dexterity tests correlated lower with spatial 
tests among both operatives and trades workers than among the 
clerks. From such findings, one might speculate regarding the possible 
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role of motor and mechanical experience acquired during the voca- 
tional training and actual job performance of these three groups. A 
difficulty encountered in making comparisons among most vocational 
groups is that such groups generally diiier in educational level, age, 
and other variables. 

Differences in factor patterns among culinral groups ought also to 
be considered, [Relevant data are virtually non-existent in this acea. 
The reason is undoubtedly to be found in llie difficulty of devising a 
battery of tests applicable to widely diverse cultural groups. Although 
attempts have been made to construct tests which are relatively 
‘‘culture-free,” such tests do not offer sufficient variety and breadth 
of content to permit factor pattern analyses of the scope conducted 
within our culture. An analysis of the intercorrelations of scores on 
even such limited tests, however, would be illuminating. It would be 
very surprising indeed if in cultures very unlike our own vve should 
find the emergence with age of the verbal, numerical, spatial, and 
other familiar ‘\aptiiudes'’ of our factor studies. 1'o be sure, a certain 
degree of differentiation into relatively unified behavior traits may 
occur with age in all cultures. But the nature of such traits and the 
degree of differentiation arc piobably most unlike those found in our 
own culture, 

* Finally we may examine briefly the results of factorial studies on 
iufraluunau groups. About a do/en investigations have been reported, 
most of them on while rats, but their rc.sults have all but defied intcr^ 
pretation.’^' In one study, for example, the intercorrclations of per- 
hminance in nine tests wl c sw low that nothing could be concluded 
beyond extreme specificity (7 1). As for the rest, the interpretations 
of the factors identified aie highly .peciilative. Most of these factors 
arc limited to a particular type of si’ nation or learning problem. Some 
have been defined in terms of specific techniques which the animal 
may use in solving moie than one problem, such as the principle of 
turning alternately right and left ( 1 07, 120). One is imprcs.sed, more- 
over, with the frequency with which fac-ors related to emotional 
aspects of behavior appear in conj’sV'tion with “intellcctuar’ factors. 
1 he relatively greater prominence oi such cmc^tional factors is also 
noteworthy. Among the factor descriptions, for example, can be 
found such terms as ‘'a combination of intelligence and tameness,” 


14 , 39 , 46 71 . 79 , 84 , 107 , 117 , 120 , 121 , 122 , 123 . 
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“wildness or panicky behavior,” “wildness-timidity,” and “sclf- 
confidcnce.” 

The diniculty of identifying factors in these animal studies, as well 
as the closer intertwining of ‘"ability” factors wilh “emotional” factors 
than in the human studies, is not surprising when we consider certain 
facts about the subjects’ backgrounds. White rats have not been sub- 
jected to formal education with standard sequences of courses in Ele- 
mentary Maze Running 1-2, Problem Solving 5-6, or Advanced 
Seminar in String Pulling! Unlike the school children or college stu- 
dents of the human factor studies, the animals have not been exposed 
to that classic dichotomy between curricular and extracurricular, be- 
tween standardized intellectual development and unstandardized emo- 
tional development. It has also been suggested that the inclusion of 
individuals from genetically different strains within the same grcuip 
may account for some of the confusion and inconclusiveness of these 
factor analyses of animal behavior (84). Probably the most fruitful 
contribution that animal studies can make to the analysis of trait rela- 
tionships is the experimental investigatitm of Jvnv factor patterns may 
be developed and altered in animals Jiving under controlled laboratory 
conditions. The opportunities provided by this a.pproachiiavc scarcely 
been recognized. 


TRAITS or PrRSONAHTY 

Typical Findings of Factor Analysis. The application of factorial 
techniques to the measurement of personality, although relatively 
recent, has come to represent an active and prolific area of research. 
Two of the best factorial analyses of personality questionnaires are 
those of Mosier (75) and the Guilfords (54, 55, 56, 57). In such 
studies, the initial data are the intercorrclatioiis among individual 
ite/ns\ rather than among test scores.^' The subsequent procedure is 
the same as in factorial analyses of abilities, with the exception that 
negative factor loadings are not usually excluded in the rotation of 
axes. Since many personality traits may be regarded as “bipolar” (e.g., 
ascendance-submission, introversion-extroversion), negative loadings 
are more intelligible in this area than in the factorization of abilities. 

” Since Ihc responses to such ilems are generally twofold tYcs )r No), a type of 
correlation known as trimchofic js employed foj this purpose, in place of the more 
familiar Pearson Product-Moment Correlation. 
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Mosier (75) administered 39 of the most discriminative items of 
the Thurstone Neurotic Inventory to 500 college men. Rather than 
finding emotional instability to be a unitary characteristic, Mosier 
found evidence for eight orthogonal traits in the responses to these 
items. I’he list, with tentative trait designations and illustrative be- 
havior, is as follows: 

C. Cycloid tendency: ups and downs in mood. 

D. Dcpicssion: lonely, frequently in low spirits. 

//. Hypersensitivity: feelings easily hint. 

I. Inferiority: Jack of sclf-confidencc. 

S. Social introversion: shy, keeps in background on social occasions. 

P. Public selj-conscionsness: difficulty in public speaking, stage fright. 

Co. Cognitive defect: personality ditficulties caused by individual find- 
ing himself intellectually below a\erage of group in which he is 
placed. 

An. Autistic tendency: daydreaming, shui-in tendencies. 

The interpretation of the last two factors, Co and An, was much less 
clear and is oHered very tentatively. 

In a scries of investigations by a similai method, the Guilfords 
(54, 55, 56) analyzed the most frequcntl\ recurring items in several 
intiovcrsioii-cxlrovcrsion questionnaires, d his analysis was later ex- 
tended to other types of personality questionnaires (57, 69). In the 
entire series, a total of thirteen factors were identified and described 
as follows: 

5. Social introversion: shy, keeps in background on social occasions. 

A 1 hinLing introversio: . inii\>spcetivc, rcficctivc, meditative disposi 
tion. 

r>. Depression: often “blue,” wori'cs over possible misfortunes. 

C, Cycloid tendency: frequent ship's of rm)od 

R. Rhaihyniia: happv -go-lucky, carefree. 

G. General Activity: tendency to engage in overt activity. 

A. As( cndance-suhniission: Si)cial leadersbip or ilominancc. 

M. Masculinity-femininity: similarity of responses to those typical of 
men or of wximen. 

Thice ciLicst'onnaircs weie conslnictcd ^ iiicasiirc thesr factors: Guilford Tii- 
vcnlory of Faciois SnX'R. Gmllord Mailin Invcntojy Factors GAMIN, and 
GLullord-Marlin Personnel Invenlorv 1 ((), L o. An). The thirteen factors are not 
entirely independent of each othei. In fact, Lovell (h8). coj relating scores on cac!i 
of the ihiilccn factors, carried out a facloi analysis on these factor correlations and 
identified four “super-factors”: Drive-Reslra‘nt, Realism, Emotionality, and Social 
Adaplabdity. 
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I, Inferiority: lack of self-confidence. 

N. Nervousness: irritability, jumpincss. 

O, Objectivity: viewing sell' and surroundings objectively, not taking 

things pcisonally. 

Co, Cooperativeness: accepting things and people as they are, tolerant, 
not fault-finding. 

Ag, Agreeahlencss: not quarrelsome, belligerent, or domineering. 

It will be noted that the tw^o investigations found several traits in com- 
mon. Social introversion, cycloid tendency, and depression were iden- 
tified in both series of studies.^'' Mosier's '‘hypersensitivity,” more- 
over, bears considerable resernbiance to the Guilfords’ ‘'objectivity," 
albeit expressed in terms of the opposite pole. It is interesting to note, 
too, that when both studies arc considered together, the concept of 
introversion seems to break down into at least three aspects, viz., 
social, thinking, and public or ‘'platform” introversion as represented 
by Mosicr’s P-factor. 

Studies such as the above bring out at least two points. First, the 
responses of American college students to personality questionnaires 
tend to be “structured" into a small number of dilTerentiable clusters, 
rather than being cither wholly specific or completely unified and gen- 
eral. Secondly, the labels commonly attached to personally inven- 
tories should be regarded with considerable caution. A single test of 
ncuroticism or intro\ ersion may measure several independent person- 
ality tendencies. Moreover, tests with different labels may measure 
in large part the same factors, as indicated by the overlap of the fac- 
tors reported by Mosier and those reported by Guilford in his initial 
analysis of introversion items. The converse is also likely to be true, 
viz., tests bearing the same label may measure a different combina- 
tion of factors. 

Mention may also be made of a number of factorial studies of 
interest inventories such as the w^ll-known Strong Vocational Inter- 
est Blank (30, 33, 41, 80). “Interest” factors have been identified 
which correspond to certain vocational areas, such as “technical 
science” occupations (e.g., mathematics, chemistry, engineering), 
social service or welfare work, selling, and financial and business 
detail work (e.g., accounting, banking, purchasing). These response 
clusters, moreover, appear to be related to broader aspects of person- 

factor labeled “inferiority,” appearing in both lists, was not ‘ndcpendenlly 
verified, but was taken by Guilford from the Mosier study. 
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ality, such as values (as measured by the Allport-Vernon Study of 
Values) and social adjustment. To find that the organization of per- 
sonality may be significantly related to traditional occupational group- 
ings which have developed within our culture is of considerable inter- 
est in connection with the problem of the origin of traits. 

Some investigators of personality have been engaged in applying 
factor analysis, not to questionnaire responses, but to behavior ratings 
of both children and adults ( 18, 25, 26, 27, 28, 70). I hc most ambi- 
tious of these projects is that conducted by R. B. Cattcll (28). 
Beginning with a list chosen to cover all the personality traits which 
had been named, either in the dictionary or in the psychiatric and 
psychological literature, Cattcll first reduced the list to 171 trait names 
by combining obvious synonyms. The next step was to obtain ratings 
for each of these 171 characteristics on 100 subjects of both sexes, 
all over 25, and varying in occupation from unskilled laborers to 
artists and business and professional people. Jiach subject was rated 
by one person who knew him well, the rating scale containing only 
two categories for each trait, viz., above average and below average. 
By correlating these ratings and grouping together all traits which 
correlated over .45 with each other, 67 clusters were obtained. 
'Through further combination of these gnuips into '‘nuclear clusters,” 
the number was reduced to 35. Ratings i^n 208 men by two independ- 
ent raters w^ere then obtained for these 35 traits. I'he men averaged 
30 years of age and varied widely in occupation. A factor analysis 
of the inlcrcorrelations of these 35 traits, follow^ed by oblique rota- 
ti(m of axes, led to w'hat the author terms “the primary source traits 
of personality.” These traits, an even dozen, are given below (28, 
pp. 475-496). 

( 1) Cyclothymia v.v. schizothymia. 

(2) General mentn! eapacily v.v. mental delect. 

(3) Emotionally mature, stable character vs, demoralized general 
emotionality. 

(4) Dominance and ascendance vs. submissiveness. 

(5) Surgcncy-- v.v. agitated, melancholic desurgency. 

(6) Sensitive, anxious emotionalnv '-.y. rigid, lough poise. 

““'riic dictionary list was based upon the Allport and Odbert compilation (2). 

Actually, correlations were computed separately on 13 gioups of 16 men each 
who had been rated by the same raters. These coi relations were then aveiagcd \ia 
Fisher’s 7-function. 

-^Th.s term refeis to plac’d, realistic cheerfulness and enthusiasm. 



524 Differential Psychology 


(7) Trained, socialized, cultured mind vs. boorishness. 

(8) Positive character integration vs. immature, dependent character. 

(9) Charitable, adventurous cyclothymia vs. obstructive, withdrawn 
schizothymia. 

(10) Neurasthenia v^. vigorous, “obsessional determined” character. 

(11) Hypersensitive, infantile, sthenic emotionality v^. phlegmatic 
frustration tolerance. 

(12) Surgent cyclothymia vs. paranoia. 

vSomc of these traits, such as “general mental capacity” or “a 
trained mind,” overlap with ability variables, but their definition is 
strongly slanted toward emotional and motivational characteristics. 
Thus, for example, the terms “deliberate” and “persevering” arc in- 
cluded in Caltcirs detailed description of “general mental capacity.” 
Cattell has maintained that this list of twelve traits is corroborated 
by the research of other investigators who used not only bcliavior 
ratings but also other methods of trait measurement. vSome of tlie 
resemblances, how'cvcr, arc not too clearly apparent. In view of 
the errors to which ratings arc known to be subject, the crudeness of 
the rating scale employed, and other methodological limitatjpns of the 
present study, much caution must be exercised in generali/ing from it. 
Certainly, data arc needed on other and more clearly delineated pop- 
ulations. Perhaps the only observation which can be conlidently made 
at this time is that the seaidi for personality traits has met more 
obstacles and inconsistencies than that for unitary abilities. 

‘‘Common” i crsit.s ‘Tnclividiial” Trails. Some w riters on persi>nal- 
ity have made a distinction between common and individual trails (ef., 
e.g., 1 ). The former refers to the sort of trait identilied through factor 
analysis and otlicr tecliniqucs based on standardized tests and on the 
evaluation of the individual in terms ol group norms. The latter, or 
individual, trait refers to the sort of trait identified by an analysis of 
the unique experiences of the particular individual. Such a trait, which 
mirrors the individual's idiosyncratic behavior organization, is ob- 
served through clinical piocedures and other intensive, prolonged, 
and relatively qualitative techniques. From one point of view, “type 
concepts” may be regarded as an attempted compromise between the 
two extremes of common and individual traits. Such theories imply 
essentially a pattern of behavior relationships shared by a relatively 
limited group of people — narrower than the groups to which the com- 
mon traits of factor analysis are ascribed, but including more than a 
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single individual. In general, individual traits and type concepts have 
flourished principally among writers on personality, while common 
traits have found more support in the classification of intellectual 
variables. 

It should be remembered that, whether found by factor analysis, 
type studies, or biographical observation of a single individual, a trait 
is always essentially a jHJtrcrn of relationships within the incUvicluahs 
behavior. The so-called common trait, located by studying a group of 
persons rather than a single individual, is simply a generalized de- 
scription of a pattern of behavior relationships shared by a group of 
persons. Why, then, have such comeion trails found more ready 
applicability in the description of intellectual rather than emotional 
and motivational functions? 

The reason is not difficult to find when we consider the greater 
uniformity and standardization of experience in the intellectual than 
in the emotional and motivational sphere (cf. 0, 10). An obvious illus- 
tration of this point is provided by our system of formal education, 
in which the standardized content of instruction is directed principally 
toward intellectual rather than emotional development. Even if the 
schools were to institute a rigidly standardized “personality curricu- 
lum” (a rather depressing thought!), we still would not expect the 
uniformities of organization characteristic of intellectual development, 
since much of the individnaPs emotional development occurs through 
domestic and recreational activities. Not only courses of study, but 
also occu[)atioiis and othci Lradiiional aieas of activity within any one 
cultural setting, tend to crvstallizc and structure intellectual develop- 
ment into relatively uniform patte«as. Such patterns become more 
clearly evident the longer the individual has been exposed to these 
common experiences, dhe increasing difl’crentiation of abilities with 
age and education becomes inLclligiblc in these terms, as do the diffi- 
culties in identifying common traits among animals. 

A fuither relevant point is the objection raised by some writers 
(cf., c.g., 1, 81) that test items ma/ have ‘'private meanings,'" so to 
speak, for each individual. Discussions of this point have sometimes 
led to rather cabalistic and obscurantist criticisms of psychological 
testing. Actually, this objection is simply another way of saying that 
the same response may not have the same diagnostic or prognostic 
significance when made by persons of widely varying experiential 
backgrounds. Since uniformities and standardization of experience in 
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our culture are more common in the intellectual than in the emotional 
aspects of behavior, “personality” tests are more subject to such a 
limitation than are “intelligence” or “aptitude” tests. A further reason 
for the greater uniformity of intellectual patterns of behavior is found 
in the degree to which such behavior has been verbalized, as con- 
trasted to emotional responses, which are more largely unverbalized. 
It may also be relevant to point out that the distinction between intel- 
lectual and emotional aspects of behavior is itself culturally deter- 
mined. 


AN EXPERIMENTAL APPROACH TO TRAIT ORGANIZATION 

Too often the trait investigator has merely asked: "'What is the 
organization of behavior?" or ""What arc the traits into which the 
individual’s behavior repertory groups itself?” rather than asking, 
'"How does behavior become organized?” and "How do psychological 
traits develop?" T he controversies between exponents of “common 
traits” and of “individual trails,” as well as the apparent inconsist- 
encies in the findings of trait research on dilTcrent age, educational, 
or other groups, point up the need for a more direct investigation of 
the mechanism by which traits develop — the way in whivdi the specific 
experiential background of diljerent individuals determines the organ- 
ization of their behavior into more or less unitary and stable traits. 

An exploratory study of this question was conducted by Anastasi 
(6). The principal aim of the investigation was the experimental alter- 
ation of a factor pattern through a brief, relevant, interpolated expe- 
rience. Five tests, including vocabulary, memory span for digits, verbal 
reasoning of the syllogistic type, code rnulliplication, and pattern 
analysis, were administered to 200 sixth grade school children. All 
subjects were then given instruction in the use of special techniques 
or devices which would facilitate performance on the last three tests 
only. In its general nature, this instructiem resembled that received in 
the course of school work, as, for example, in the teaching of arith- 
metic operations, short-cuts of computation, and the like. After a 
lapse of 13 days, parallel forms of all five tests were administered 
under exactly the same conditions as in the initial testing. Since the 
entire experiment was of such short duration, age changes were prob- 
ably negligible and the influence of other, outside conditions rela- 
tively slight. 
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A comparison of the intercorrelations among the five variables in 
the initial and final testing showed practically no change in the corre- 
lation bclw^ecn the two “non-instruction'’ tests, viz., vocabulary and 
memory span. A slight change was found in the correlations between 
the “instruction” and “non-instruction" tests, and a marked change 
in the correlations aimmg the three ‘Instruction" tests. Factor pattern 
analyses revealed marked difiercnces from the initial to the final test- 
ing. An examination of the factor loadings in the five tests belore and 
after the instruction suggested that the changes were such as would 
have been expected from the nature of the iriter]Kfi:ited experience. 

An interesting parallel in an everyday life situation is provided by 
a study on the organization of mathematical ability in English school 
children (76). Wide variations in the correlations among arithmetic, 
algebra, and geometry test scores w'cre found in dilferent school 
classes. 7'hcse variations w^ere shown to be related to such conditions 
as whether or not the three school subjects were taught by the same 
teacher, or whether the teaching methods emphasized similarities of 
technique among these dilferent branches of mathematics. 

Also relevant jire studies on the eficcts of practice upon factor pat- 
terns. Woodrow (127, i2<S), for example, found marked changes in 
the factor loadings of tests following prolonged practice. Nor were 
these changes a matter of greater reliance upon speed or upon general 
ability alter practice, as might have been expected. Specific changes 
in the factorial composition of most of the tests occurred in the course 
t)f practice, with no evideuwe for the increasing role of speed or gen- 
eral ability, nor for the presence ol a general learning factor. 

Such cxpciimcntal a])proachcs E the development of traits open a 
way for exploring the mechanism whereby the trails identified in the 
purely descriptive or cross-sectional studies may have developed. The 
accumulated cfTccls of education, occupation, and other everyday life 
activities upon the organization of behavior may be illuminated by a 
study of the condensed effects of short-ra;igc, experimentally con- 
trolled experiences. 

It has been suggested that in these experiments all that may be 
changed is the work method used by the subject in performing the 
tests.--^ Such an explanation is certainly plausible, but it should be 
used consistently. For example, when the test scores of subjects of 

Cf. Thurstonc, 109, p. 210 For a clear exposition of the role of work methods 
in individual dilicrcnccs, cf. K. H. Seashore, 85. 
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different ages, occupations, or educational levels shov/ diverse factor 
patterns, such differences, too, may be explicable in terms of different 
methods of work. Moreover, any uniformity of factorial organization 
among members of a given population may be partly the result of 
commonly acquired methods of work. Factor pattern analyses show 
only the organization of behavior as it is found in a group of subjects, 
but do not indicate the origin of such organization. 

If we grant that the ‘'traits” identified by factor analysis arc simply 
functional groupings observable within the subject’s behavior, then 
such trails cannot at the same time be conceived as ‘'underlying abili- 
ties” which remain unaffected while the subject’s method of doing a 
task and his objectively observable behavior are profoundly altered. 
Even the common assumption that certain ultimate limits of perform- 
ance arc set by the individual’s sensory, neural, and muscular equip- 
ment must be modified in the light of the possible variety of work 
methods. Changing the method of work may in part overcome some 
of these physical limitations and thus permit the individual to surpass 
his previously established “capacity level.” The whole process of edu- 
cation is, in one sense, a means of changing work mctlTods. 

In summary, it would seem that the relationships among the indi- 
vidual’s scores on a number of tests ut any one time may be described 
in terms of a small number of relatively unitary factors. Under exist- 
ing cultural conditions, a certain degree of uniformity of factor pat- 
terns is found because of general environmental uniformities. Such 
uniformity of factor ])attcrns is greater in the intellectual than in the 
emotional aspects of behavior, and probably reflects the influence of 
traditional educational curricula, vocational classifications, and the 
like. Thus in the }oung school child we find a large general factor 
through all types of activities which are taught in our schools, the 
so-called higher mental processes. As the child grows older and spe- 
cialization of function is encouraged, certain culturally determined 
differentiations appear. “Group factors” are produced in linguistic, 
mathematical, mechanical, and possibly other functions. These factors, 
however, arc only a mathematical statement or conceptual simplifica- 
tion of the observed relations among concrete responses. And as such 
they may be expected to shift from time to time in tl c same subjects 
or from one population to another because of varying experiences. 
Such terms as “primary abilities” or “basic traits” are likely to be 
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quite misleading. They may cause us to forget the real nature of 
factors. 
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In part II WE suRVEvrD some of the mnjor findings on individual 
differences and attempted to unravel the factors and conditions which 
produce variation from one person to another. With this background, 
wc may now turn to an examination of certain ^^loups into which indi- 
viduals arc commonly classified. Such groupings have been built up 
through social and ..iiltural traditions and iliusirate the general tend- 
ency to employ rigid categories and sharp divisions. Thus individuals 
are popularly classed into the normal and the abnorma.L the genius, 
the feebleminded, the insane, the neurotic. Psychological dill'erenccs 
aro expected, or at least sought, between the se\es or among nations 
or ''races.” Many other groupings can likewise be eor.strued. A person 
can be classified, for example, in regard to religion, political allihation, 
social status, oi e\en place of residence. Psychological dilTercnces 
might be expected between urban and rural popniatit>ns, or between 
groups inhabiting regions of dillerent geographical characier, such as 
mountainous or Hat, inland or coastal, cold or warm. 

These various griuiping , like all rigid elassifieaiions of individuals, 
arc arbitrary and artificial. In all behavioral traits, people arc distrib- 
uted along a c<mtmuous scale and i mnot be assigned to distinct cate- 
gories. When the distributions of .my two biologically or culturally 
diirercntiatcd groups, such as the sexes or "racial” and national 
groups, arc compared, the overlappini^ is so large as to render any 
dilTcrencc I'ctwecn averages of doubtful practical significance. In 
such compariMjns. the ditl’erence between t!ic av erages is far smaller 
than the range of dillerence within group. In dw stiuJy of indi- 

viduals, Ihe (niiy pruuer unit is the individucd. There is no short-cut 
to the understanding of people, no possibility of learning the be- 
havioral peculiarities of a few hro;»d groups into which any individual 
could then be ccnvenicntly pigeon-holed. 
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The multiple and complex determination of the individual’s be- 
havioral development should in itself make us skeptical regarding any 
simple systems of characterizing people. Yet it is an all too common 
practice to expect an individual to be dependable, or shiftless, or dull, 
or excitable, or poor in mechanics, or to ascribe to him dozens of 
similar characteristics, simply from the knowledge that such a person 
is a man or a woman, or that he belongs to a particular “race” or 
nation. 

It is partly to clarify these muddled popular notions that the em- 
pirical study of group differences ought to be undertaken. To be sure, 
a careful examination of the principles underlying individual variation 
in general should suffice to show ihe fallacies inherent in many popu- 
lar claims regarding group differences. But when beliefs arc as deep- 
rooted and emotionally tinged as those governing many group rela- 
tions, they arc not easily dislodged. Direct evidence on the nature of 
group differences is more convincing than deductions from generally 
established principles. 

From a more theoretical point of view, the analysis of group differ- 
ences is a valuable adjunct to the investigation of individual differences 
in general. The existence of culturally diverse groups may be re- 
garded as furnishing a natural experiment in the production of human 
variability. If psychological x:iiffercnces among groups are investigated 
with reference to the factors which brought them about, the under- 
standing of individual differences will have been considerably 
furthered. 

In the present analysis of group differences, we shall be more con- 
cerned with fundamental concepts and methodological issues than 
with a cataloguing of results. 'Ihe latter have little meaning unless 
critically evaluated. The data on group differences are difficult to 
interpret and have frequently led to opposite conclusions in the hands 
of different writers. It is of fundamental importance, therefore, that 
the special difficulties inherent in group comparisons be clearly real- 
ized and that the necessary cautions and controls be applied before 
making any generalizations. With a clear understanding of the prob- 
lem, the reader will be in a position to make his own interpretation of 
any data which he may come across. And he will also bo able to guard 
against hasty generalization and to detect the fallacies in erroneous 
statements with which he may be confronted. Objective and critical 
habits of thinking are more urgently needed in the field of group dif- 
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physiology during the Renaissance. The medical conception of mental 
abnormalities is still prevalent at present, especially among psy- 
chiatrists. 

Jhe psychological study of the abnormal is of relatively recent date. 
Its approach to the problem is through a direct study of behavior. 
In some cases, behavioral disorders may have structural concomi'uints, 
such as physical diseases, lesions, and malformations. But in the ma- 
jority of cases no such physical bj->is has been discovered and it would 
only obscure the issue to attribute the behavioral manifestations to 
unknown organic causes. Analysis of the behavioral history and 
environmental background of the individual, on the other hand, often 
reveals an adequate explanation for the development of the particular 
symptoms. Behavior disorders aic the special domain of the psycholo- 
gist and can be stu lied directly in terms of behavior principles, with- 
out vague, hypothetical reference to some other realm or class of 
phenomena. Jt should also be noted that abnormality is specific. The 
indixidual may be quite abnormal in one trait and yet remain close 
to the norm in other respects. This true of both intellectual and 
emotional traits, and lollows directly from the organization of be- 
havior traits (cf. Chs. 14 and 15). 

Abnormal p^iycholoi,^ is an empirical and direct study of behavior 
deviations. As such it may be regarded as a subdivision of differential 
psychology. A distinction is now made between feeblemindedness, or 
intellectual deficiency, and personality disorders. Ihesc two categories 
of behavior deviations wi! be considered in the sections which follow. 

1 FIBLEMINDEDNESS 

Definition and Levels. Feeblemindedness represents the lower end 
of the distribution of intelligence. It is characterized by intellectual 
rather than emotional or peisonality defect. 1 he term “feebleminded- 
ness’' is not used, however, to cover dcficicnf y in ativ ability. Thus an 
individual may be far below averagr hi music, drawing, or mechanical 
aptitude, and >till be regarded as intcliectually normal. eeblcwinded- 
ness** designates a deficientx onh in those abilities wliuh have proved 
essential for survival iu our cultural milieu. 

As was indicated in the preceding cliapter, verbal ability probably 
plays the dominant role in our conception of feeblemindedness. Lin- 
guistic deficiency has often been explicitly accepted as a criterion of 
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mental deficiency. Thus Binet and Simon (12) wrote; “An idiot is a 
person who is not able to communicate with his fellows by means of 
language. He docs not talk at all and docs not understand.” Similarly, 
Esquirol (cf. 38, p. 165) distinguished between three levels of feeble- 
minded persons: (a) those making cries only; (b) those using mono- 
syllables; (c) those usmg short phrases but not elaborate speech. 
Another classification which is still widely quoted (cf. 38, pp. 165- 
166) is that which distinguishes between: (a) idiots, who arc incapa- 
ble of spoken language, and are limited to gestures; (b) imbeciles, 
who are able to understand and employ st>oken language; and (c) 
morons, who are also capable of acquiring wTitten language, but have 
dilTicuity with the more complex verbal and abstract concepts. 

Feeblemindedness has been described from many points of view. 
Probably the most common definitions arc the sociological, or eco- 
nomic, and the psychometric. A widely quoted schema of classifica- 
tion, adopted in 1908 by the British Royal Commission on the 
Feebleminded (14) and still found currently useful (cf. 26, 86 1 , 
illustrates the sociological conception. This classification recognizes 
three grades of feeblemindedness, characterized as follows: 

1. Idiot (low-grade amentia) — “A person so deeply doleetive from 
birth or from an early age that he is unable to guard himself against 
common physical dangers.” 

2. imbecile (middle-grade arnentia) — “One who, ])y reason of mental 
defect existing irom birth or from an early age, is incapable of 
earning bis owm living, but is capable of guarding hirnsell against 
common physical chmgers,” 

3. Moion (high-grade amentia) — “One who is capable of earning a 
living under favorable circumstances, but who is incapable, from 
mental defect existing Irom birth or Irom an eaily age, (a) of com- 
peting on equal terms with his normal fellows, (b) of managing bis 
affairs and himself with ordinary prudence.” 

The psychometric classification is more common among mental 
testers and permits more quantitative definition. When applied only 
to adults, the difTerentiation is often made on the basis of mental age. 
Thus an adult whose mental age is three years or less is usually 
regarded as an idiot; between three and seven is the imbecile level; 

^ hi England, the icrni “feebleminded” is reserved for this level of mental de- 
ficiency, and “amentia” is used as a general tcim to cover all clegijcs of mental 
deficiency. The icim “moion” has been substituted for “feebleminded” in the above 
definition, in accordance with the more familiar American usage. 
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morons fall above a mental age of seven but fail to reach the normal 
adult level. To make the classification applicable to children as well 
as adults, the limits have been expressed in terms of IQ. Terman’s 
classification (83, p. 79) is probably the most widely employed and 
has been reproduced below. 


Ct//r;v//v IQ 

Dullness, rarely classifiahlc as fecbleminilcclnoss 80-90 

Borderline defieicnLy, sometimes classifiable as dull- 
ness, often as feeblemindedness 70-80 

Moron 50 70 

Imbecile 20-50 

Idiot below 20 


Tt should be borne in mind that these distinciions arc purely arbi- 
trary and arc made only for practical convenience. I here is no sharp 
dividing line either between the normal and the reehleminded or be- 
tween the various ‘“levels” of feeblemindedness. 1 he intellectual 
dilTercnces arc of degree only and form a continuous gradation, 
although the social elTeets may dilTer qualitatively. The diagnosis of 
feeblemindedness, moreover, should never be based solely upon an 
10. The feebleminded individual has been described as subnormal in 
‘'personal dependence, scIf-direction, social responsibility, and self- 
support” (26, p. 867). A useful adjunct to intelligence tests in arriv- 
ing at a practical classification of feeblemindedness is the Vineland 
Scu’ial Maturity Scale (24), which measures the individuafs “social 
age” from 0 to 25 years. This scale is a means of evaluating the indi- 
vidual’s everyday life behavior in terms of age norms. 1 he subject’s 
emotional balance, health, physique, special skills, and environmental 
milieu all contribute to the adequacy with which he can cope with 
everyday life problems; thus they indirectly inlluencc the diagnosis of 
feeblemindedness in individual cases. 

Estimates of the percentage of feebleminded persons in the general 
population range widely, but the most reliable and comprehensive 
investigations report frequencies falling between one and two per cent 
(26). The specific per cents found in different surveys vary with the 
criterion of feeblemindedness employed — whether tests, social com- 
petence, or a combination of the two — as well as with the point at 

^This chissificMtion was oiii/inallv based unnn the UUfi revision of the Stanfoid- 
Bind. .Subsequent conipaiisons (64, 71), however, have showai close ajiiecmcnt ni the 
results obtained with the revision at the lowci IQ level.., and the above classifi- 

cation is still considered satisfactory as a rough practical guide. 
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which the dividing line is set. Geographical locale also makes a con- 
siderable difference, the incidence being much greater in some areas 
ihan in others. Age, too, will affect the estimate, since the relatively 
short life expectancy of the feebleminded tends to make the proportion 
of feebleminded in the population appear greater when only children 
arc included in the survey than when all ages are covered. Similar 
difficulties are encountered in the attempt to determine the relative 
frequency of different levels of feeblemindedness. Among institu- 
tionalized cases, approximately I09r nre i liots, 30% imbeciles, and 
60% morons, but the proportion of higher-level cases outside of 
institutions is probably larger, since such cases arc more likely to 
shift tor themselves or to be cared for at home (26). 

Varieties and Contributing Factors. The feebleminded have also 
been classified with respect to variety or clinical type, on the basis of 
differentiating ph>sical conditions.^' Among the most familiar of such 
clinical types is Mongolism, named from the oblique, slit-like eyes 
which produce a superficial resemblance to the Mongolian face. 1his 
type can readily be identified by a number of other physical character- 
istics, such as small, round head; smooth, moist, puffy skin; fissured 
tongue; and short, stubby fingers. I his is one of the mosf frequent 
clinical types, constituting from 5% to 10%, of the population of most 
feebleminded institutions. Among the pi’jssible causes oi Mongolism 
suggested by different invcstigsitions are nutritional, toxic, and endo- 
crine distiubances during utciinc life (6, 40). Age ol the niotlvT 
seems to be a factor, tlie propiKtion of Mongolians born to mothers 
over 40 being much gre ater than the proportion born to younger 
mothers (34, p. 117), 

About equal in ficqucncy to Mongolism is tlie t\pe of feeblemind- 
edness traceable to inirdciankil hirtli lesions. As generally used, this 
category covers not only injuries sustained through instrumental or 
difficult delivery, but also such conditions as neonatal asphyxia, pre- 
mature birth, and infectious or toxic factors operating before birth. 
There has been a growing conviction that many otherwise undifferen- 
tiated cases of Icebleniindedness may Jiave originated in this fashion 
(7, 23, 25, 75, 77). Motor disorders of varying degree of sc\crity 
may be present, as in the familiar “spastic” cases. It is entirely pos- 
sible, however, for the motor symptoms to develop in an individual 

detailed suivey of cliniLal varieties, their chaiiiciciistits, and siipgesteJ causal 
theories can be found in wSheiman (76) and liedgold (86). 
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of normal or superior intelligence." What is more important for our 
present discussion is that the intellectual defect may occur in an indi- 
vidual without the motor symptoms. In such cases, the cranial injury 
is not likely to be suspected without an examination into the birth 
records. The particular combination of symptoms which develops is 
probably related to the extent and location of the cerebral injury. 

Other clinical types of feeblemindedness are relatively infrequent, 
occurring in less than one per cent of the institutionalized population. 
The microcephalic has an abnormally small, pointed skull, with a 
characteristic "'sugar-loaf’ appearance. Tlic hydrocephalic has a very 
large skull and an excessive accumulation of cerebrospinal fluid in the 
brain. Ihc cretin is easily identilicd by his stunted physique, coarse 
thick skin, loss of hair, and other physical characteristics. Thyroid 
deficiency has been clearly identified as a major factor in cretinism. 
The administration of thyroid extract, if begun early in life, usually 
effects a considerable improvement in both physical and intellectual 
condition, altliough some cases do not respond to this therapy. The 
causes of microcephaly and hydrocephaly are not so w^ell established, 
but there is evidence suggesting the role of prenatal factors, including 
maternal nutrition, toxins, infections, and radiation (72; 76, 
pp. 145-154). 

A relatively rare but clearly identifiable clinical type is phenyl- 
pyruvic amentia (33, 42, 50, 69). These cases arc differentiated by 
the presence of plienylpyruvic acid in the urine, resulting from a 
hereditary metabolic disorder. The condition appears to depend upon 
a single recessive gene (7^M, and has never been found in a person of 
normal intelligence (43). It is usually accompanied by motor symp- 
toms and is found in association with a severe grade of mental 
deficiency. 

Mention may also be made of the suggested role of the Rh factor 
in mental deficiency. This is one of the factors determining the “blood 
groups,” w'hich have become familiar to general public principally 
in connection with blood transfusions and v/ith the determination of 
paternity. 1 he Rh factor, discovered in 1939, is peculiar in that it has 
no natural antibody in human blood, but it may provoke the produc- 
tion of antibodies when introduced into the blood of persons lacking 
in this factor (rh negatives). Jt has been estimated that about 15% 

" rf., c p., thr inteiestmg: biographies of intellectually superior peisons with 
ceicbral bath injuries, such as: lloopcs. G. G. Out of the Running. Springfield- 
Thomas, 1939> Pp. 158. Carlson, E. R. Born Way N. Y.: Day, 1941. i*p. 1?-*.. 
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of the population arc rh negative and therefore susceptible to the 
production of such antibodies. Some important implications of this 
situation for fetal development have been discovered. First, it should 
be noted that a certain amount of blood transfusion occurs between 
mother and child during uterine life. Jf the mother is rh negative and 
the child Rh positive, antibodies will be formed in the mother’s blood 
as a result of such a pregnancy. The first-born is not usually affected 
by this condition, since it takes time for the mother to develop the 
antibodies. Subsequent offspring, however, if again Rh positive, may 
develop a severe physical condition which generally proves fatal 
before birth (29, 58, 78). 

It has been sugg.esled that in those cases where Rh incompatibility 
of mother and child does not result in any observable physical dis- 
orders, the effect upon the fetal blood may still be snllicient to inter- 
fere with proper brain development and thus indirectly lead to 
feeblemindedness. Studies of the blood groups c'f feebleminded chil- 
dren and their m('»thers have shown that, in a certain percentage of 
cases not classifiable into any of the known clinical types, the mental 
deficiency may have resulted from such Rh ijjmuinization (20, 21, 
79, 86, 90). d he per cent of Rh positive children witli negative 
mothers in such groups significantly cxceetled chance eK[)ectatiim. 
Subsequent results ln\e tended to qualify these conclusions and to 
indicate that the operation of the Rh factor in feeblemindedness is 
probably much less frequent than was suggested by some of the early 
findings (74, 81 ). dhe hypotliesis, however, remains a plausible one, 
at least i'or a small number of ca^es, and leseareh akmg these lines is 
being actively cairied forw'ard. 

d'hosc cases not falling into any of the above clinical types arc 
v^lassified as nfKliflcrcniiafcd mental (lefieicfKW d'iiis is by far the 
largest category, including from onc-third to iwo-lhirds of all insiitn- 
tionaliz^d cases (26 K The total proportion is probably much greater 
because such cases, being noima! in appearance, arc Ic>s likely to 
attract attention and be institutionalized. Other designations for this 
category are '‘familial ty];e" and "primary” or "endogenous'’ mental 
deficiency, as contrasted to "secondary” or "ex(>genous.” 'These terms 
arc misleading in their implication that imditTcrcntiatcd mental 
deficiency is necessarily hereditary. 'Fhcre is no more reason for asso- 
ciating heredity wath undifl’erentiated mental deficiency than for 
associating it with the other clinical varieties discussed above. All we 
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can say positively about undifferentiated mental deficiency is that no 
physical basis has yet been discovered for it. In some of these cases, 
specific physical factors may eventually be identified, as illustrated by 
the recent findings on the Rh factor and on intracranial injuries. For 
the rest, it is possible that the feeblemindedness is not associated with 
any structural deficiency but only with experiential factors. 

The fact that undifferentiated menial deficiency tends to rim in 
families (hence the designation "‘familial type'’) may, of course, be 
interpreted as evidence for environment just as well as for heredity. 
In this connection, it is interesting to note that the few studies which 
report improvement in intellectual level as a result of special training 
have found that it is the undiffercntialed type that responds most 
readily to such training (48, 49). It is also relevant to observe that 
undifferentiated aments more often come from homes of lower socio- 
economic level, while the specilie clinical types show a more random 
distribution of home background (13, 35). The latter occur with 
greater frequency than the former in families which are normal or 
superior in intellectual and socio-cconornie level. Inferior home 
environment may be a factor in the intellectual retardation of at least 
some of the cases in the undifferentiated group. 

Certainly the term "‘undifferentiated’' is more precisely descriptive 
of our knowledge regarding this type of mental deliciency than are the 
other suggested designations — “unknown” would probably be a more 
candid characterization. To assume a hereditary basis for just those 
cases in which no structural deliciencies have yet been demonstrated 
seems to suggest that fecidemmdedness is itself a chemical substance 
which can be transmitted by the genes! Unless some structural defi- 
ciency is demonstrated, whal is tl\'re for these cases to inherit? The 
evidence for hereditary contributions seems, in fact, to be much clearer 
in the case of the so-called secondary forms of feeblemindedness. 
Glandular and metabolic conditions of the mother, blood groupings, 
and even maternal body formation which might increase the chances 
of a difficult birth undoubtedly have a hereditary basis. From one 
point of view, of course, it may be i rgued that the feeblemindedness 
in such cases is only an indirect result of the hereditary condition. But 
this only serves to point up the artificiality of the heredity-environ- 
ment distinction, especially as applied to behavior. Of more practical 
significance is the distinction proposed in Chapter 4 between struc- 
turally and functionally determined conditions. The specific clinical 
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types of feeblemindedness discussed above are all structurally deter- 
mined and as such would be relatively uninfluenced by training. In 
such cases, the structural deficiency interferes with the acquisition of 
normal behavior. Many of the “undiftcrentiatcd” cases, on the other 
hand, may be functionally determined and therefore much more 
responsive to training. 

Intellectual and Physical Status. It has been repeatedly demon- 
strated that the feebleminded arc not equally deficient in all functions 
and that the degree of their inleriority increases as we go Irom simple 
sensory and motor tasks to complex intellectual processes, especially 
those dealing with symbols (17, 3H, 62, 66). In a pu)neer study, 
Norsworthy (f^6) administered a series ol tests, including comparison 
of weights, cancellation, memory, and word association, to I'i? insti- 
tiitionali/cd defectives. In the comparison of weights, 2S9<r of the 
feebleminded reached or excelled the performance ol the lowest quar- 
ter of noimul subjects.'^ In the cancellation tests, the per cent of feeble- 
minded reaching or exceeding the lowest quartiie of the normal gioup 
ranged from 14 to 18, in the memoiy tests from 18 t ) 19, and in two 
ol the association tests Iroin 16 to 17. In the two remaining as‘^ocia- 
tion tests, which involved the naming of opposites, onh alifiut ITr 
the feebleminded rcaehed or exceeded the lowest quaitile of the 
normal These findings were conoborated by Memll ((;2) m a com- 
paiison of the performance of mentally deficient and normal children 
on the separate tests of the Stanlord-Binet. \ he achievement of mental 
defectives in different school subjects shows a similar hierarchy (17, 
62). The feebleminded as a group are most deficient in verbal '‘Ub- 
jeets, such as reading comprehension, less deficient in arithmetic 
computation, and closest to the norm in drawing and shop work. 

I his does not mean, ht'wcver, that such a hierarchy of deficiency 
necessarily exists within the individual feebleminded person. The same 
result might follow it there were more feebleminded peisons deficient 
in verbal ability, fewer deficient in arithmetic ability, and fewest in 
mechanical or sen son-motor aptitudes. The relationship between such 
group averages depends not only upon the relative amount of inferi- 
ority displayed by each individual, but also upon the niimher of per- 
sons who arc inferior. Since verbal aptitude plays such a large part in 
the criterion of feeblemindedness, almost all persons in a fecble- 

** If the two gjoups were equal, V'SS of the feebleminded group would reach oi 
exceed the lowest quarter of the normal group 
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minded group will be definitely below normal in this trait. This con- 
sistent inferiority will of course produce a very low group average in 
verbal traits. 

In numerierd aptitude, many will be bclow^ average, some may be 
normal, and a few even superior. The slight positive correlation be- 
tween performance on numerical and verbal tests, as well as the fact 
that numerical tests are [requently included in scales of “general 
intelligence," would lead us to expect the mahmty, but not all, feeble- 
minded persons to be below the norms in numerical aptitude. This 
would result in a gioup average higher than that in verbal traits, but 
still considerably below normal. 

In tasks involving sensori-motor skills or aptitude in mechanics, 
music, or pictorial art, we should expect the feebleminded distribu- 
tion to approximate even more closely that oi a normal group, since 
these trails shov\ vc:y low correlations with verbal ability or intelli- 
gence test performance. Ihe majority ol the teeblemindcd would be 
nearly normal in these functions, only a small number inferior, and 
a lew superior. As a result, tlic status of the group iis a whole would 
be only slightly belovv normal. 1 hus it is apparent that the hierarchy 
of deficiency usually found in feebleminded groups may result from 
the culturally imposed crUerion ol Jeeblcmindedness and from the 
organization of abilities. 

Many observers have called attention to the ri^idUy and stereiftypy 
characteristic of the behavior ot mental defectives/^ Institutionalized 
morons, for example, will olten carry out routine tasks with unswerv- 
ing precisicm and with no .ign^ of boredom. Such persons seem well 
quaiilied for monotonous, repetitive tasks. It is, ol course, well known 
that monotony is a funelion of the nature of the task, the distracting 
stimuli, and the charaelenstks of the worker. I he individual with 
relatively lew competing interests and limited ability will find a repeti- 
tive task more congenial and satisfying than a task involving many 
shifts and readjustments. Despite the evidence of “rigidity” in the 
behavior of many feebleminded persons, it should be noted that the 
feebleminded are capable ol considt’'.iblc impwvcnicni throu^^h learn- 
ing. Thus in an experiment with simple sensori motor and percepliial 
tasks, feebleminded adolescents improved with practice about as 


Kouniii (53), for ex.implc, icporls sonu siij.^pcstivc data obtained with tests of 
r!g.dil> on oidci and youn'uw lethlein'nded persons and (.n normal ch idien, all 
groups oeing equated in mental age. 
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rapidly as normal children of the same mental age (89). Such a find- 
ing suggests that all but the lowest-level cases can certainly profit from 
training, if the tasks and methods of instruction are suited to their 
mental age level. 

In general health, siisceptihilily to disease, and physical develop- 
ment, the institutionalized feebleminded as a group arc below normal. 
Data on this question have already been discussed in Chapler 12. It 
will be recalled that such comparisons must be accepted with caution 
because of the inclusion of piiysically defective clinical types, the low 
socio-economic background of most cases, and the selective factors 
in instilutionalizalion. k is certainly not difficult lo find individuals 
among the higher grade undilTerentiated mental defectives who are 
sturdy, healthy, and good-looking by normal standards. 

Oiiflook for Social Adjustment. Idiots and most imbeciles obvi- 
ously require either institutional or home care. Within the much larger 
group at the moron level, however, a considerable proportion of indi- 
viduals arc their own.” It is this group that has been a source of 
concern as a potentially serious social problem. Recent follow-ups 
have shown that, with a certain minimum of training and supervision, 
the outlook for such mental defectives is more favorabk than was 
formerly supposed (76). Persons who have acquired good work 
habits and mastered a simple skill in a feebleminded institution have 
sometimes succeeded in their Jobs as well as or better than normal per- 
sons with a long work history. 

This should not be surprising when we consider the number of jobs 
in our society which do not demand a high intellectual level. C)nc sur- 
vey of over 2000 jobs, conducted with special reference to the voca- 
tional guidance of the mentally defective, revealed 19 types of 
occupations for w^hich a minimum menial age of onl\ 6 years was 
required ( 16). A number of other kinds of work could be successfully 
performed by persons with mental ages of 7 to 1 1 (16). In another, 
more extensive survey of 18 industries providing 2216 specific occu- 
pations, 47.1% of the jobs required no education beyond the ability 
to speak, read, and write simple English (5). A total of 67% 
called for no education beyond elementary school graduation. Even 
in these cases, it is doubtful whether some of the more abstract con- 

e.g., Ihc drainal'/cci picsentalion of this problem in F R. vv'embridge, I-iy- 
Amonii the Low-Brows Boston: Houghton Mifflin, 1931. Pp. 310. 
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tent of the elementary school curriculum, which might be beyond 
the grasp of the average moron, was really necessary for job success. 

Among individuals paroled or disi har^ed from feebleminded in- 
stitutions, the proportion who adjust satisfactorily is undoubtedly 
large enough to justify such a parole practice. Specific estimates are 
difficult to summarize because of varying standards of successful 
adjustment applied by dilTcrcnt investigators. The estimates of suc- 
cessful social adjustment, without reference to earning capacity, 
range from about 50% to 72% of those paroled (22, 68, 82). The 
proportion is much smaller when adequate vocational adjustment is 
considered. One estimate reached from a consideration of several 
available surveys (76) sets the proportion o1 those adjusting ade- 
quately on an economic and social basis without supervision at only 
about 5%). An addilioniil 20% can be sell-supporting with super- 
vision; much larger percentages can do some productive wi?rk but 
not to the extent of being self-supporting; and still others, although 
unable to hold any job, can remain with their families without creat- 
ing social problems. These estimates arc rather conservative, and it 
should be remembered that they are based on surveys which were 
conducted largely during a period of economic depression, when jobs 
were hard to find. In contrast, it is interesting to note tlie findings of 
a survey of 177 young people paroled in 1941-42 Irom a training 
school for mental defectives (37). Within this group, 88%, were 
employed, many above the level ol unskilled Libor. Most were earn- 
ing from $40 to $60 a week, liad held their jobs over three months 
at the time of the survey, ai;d had secured them without the assistance 
of family, friends, or social agencies. 

Follow^-ups of children who havv been diagnosed as mentally de- 
fective and placed in special classes in the public ^eluK^l iyslem show, 
in general, a larger proportion of successful adjustment. \ his is to be 
expected, not only because such groups are likely to include children 
of higher intellectual levels, but also because commitment to an 
institution is often an indication of poor social adjustment coupled 
wath mental defect. In one foIlow-;'Ti (27), 122 out of 166 Balti- 
more school children who had been diagnosed as mentally defective 

A lotal of 211 inclividujls had been paroled, of whom h had been lednsritu- 
lionalizcd or were described as iinempiovable ai the time ol the siiivcy, and 28 could 
not be located The picsciU analysis was based on the remaining P7 cases. 
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were located 17 years later. Over 75% of this group had never re- 
quired any support from a social agency. The large majority had had 
no brushes with the law, although the number with court records 
was about three times as large as that in a normal group from the 
same school district. In interpreting the latter finding, it should be 
remembered that a large number of mental defectives come from 
squalid, depressing, and unhappy homes, surroundings which fre- 
quently lead even the intellectually normal to jail. 

In a later and more thorough investigation (2), a similar group of 
“opportunity class” children in Nebraska were investigated when all 
were between the ages of 21 and 34. Of the original group of 206, 
196 were located and compared with a high-normal group whose 
IQ’s ranged from 100 to 120. Slightly less than 1% of the original 
mentally defective group were in institutions for the feebleminded 
at the time of the follow-up. Educationally, the subnormal group had 
completed an average of 4 or 5 grades, in contrast to the 12- to 
13-grade average of the “normal,” control group. As in the other 
survey, the majority had no court records, although the proportion 
with such records exceeded that in the control group: v..v. 4% 

for juvenile court, and 18% v.v. 6% for police courff The pro- 
portion of the mental defectives who had held relatively permanent 
jobs was 39%, as compared with a proportion in excess of for 
the normal, control group. rAmong the subnormal, however, 83% 
had been partially self-supporting for varying periods of time. The 
proportion of girls who had married was about equal in the two 
groups, although the subnormal girls tended to marry earlier and 
have more children. Among the boys, the percentage who had mar- 
ried was much smaller for the subnormal, probably because of eco- 
nomic reasons. In a later follow-up on an intermediate, ‘"dull” group, 
more delinquency and poorer social adjustment were found than in 
the mentally defective group (3). 

Several additional points should be considered in evaluating the 
findings of such surveys. First, a large number of these “special class” 
children come from inferior homes in which the parents arc unable 
to provide adequate direction. With proper supervision, many more 
cases could probably adjust socially as well as vocationally. Sec- 
ondly, job turnover is very common among sucli groups and is taken 
as an index of poor vocational adjustment. In at least some of the 
cases, such turnover could probably be avoided by better vocational 
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counseling. Thirdly, the fact that the more recent surveys tend to 
show more favorable outcomes may result in part from the improved 
training and guidance facilities for such groups, both in institutions 
and in the school system. In this connection, mention may also be 
made of the suggestive findings reported by Schmidt (73) in the 
case of a specially designed educational program (cf. Ch. 8). 

PERSONALITY DISORDERS 

Psychoses. “Insanity,” more technically known as “psychosis,” 
represents a pronounced maladaptive personality deviation. In such a 
condition, the individual, although often intellectually normal or even 
superior, is unable to make a satisfactory adjustment because of seri- 
ous personality disorders. Thus he may have delusions of pvTseculion 
which make him sii .pect all with whom he comes into coutacl of 
plotting to poison him, or delusions of grandeur in which he believes 
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Fig. 85. Delusional Animal Drawn by a Paranoid Patient. (From the 
authors' collection.) 
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himself to be Napoleon or some other favorite character. Such symp- 
toms are described as paranoid. Drawings by two paranoid patients 
are reproduced in Figures 85 and 86. The first of these shows the 
“Orange Suicide Yaw,” an imaginary animal which the patient be- 
lieved to be lodged in his stomach and to which he attributed all his 
difiiculties. The animal is entirely black, with the exception of a 
bright orange tongue. Figure 86 is a typical astronomical drawing by 
a patient who had the delusion that he alone had received secret 
knowledge which explained the motion of the earth.^- 

The individual may withdraw so far into his own fantasy-life that 
he loses all contact with his fellow-beings and w'ith occurrences about 
him, as in schiz.ophrcwa. Also characteristic of schizophrenics are 
such symptoms as hallucinations; disorganization of thinking and 
doing; strange, bizarre activity; and odd, stereotyped mannerisms and 
posturing. It has been estimated that schizophrenics constitute nearly 
half of the total resident mental hospital population (56, p. 43). 
The incidence in the general population has been placed at approxi- 
mately 0.85%.. Another frequent category, occurring in about 0.44% 
of the general population, is the mank-depressive group of psychoses, 
characterized hy recurrent periods of extreme depression and cxcite- 
nietit or overactivity. 

vSiich psychotic conditions are ik'l to be confused with feeble- 
mindedness. Psychotics are recruited from all intellectual levels, the 
majority falling within tlie normal range of intelligence. Instances are 
not unknown among the intellectually gifted. Psychotic conditions may 
likewise occur among feebleminded poisons, although for senue psy- 
clu^scs a certain niininuim complexity of intellectual development 
seems to be required. Certain psychoses, such as scinzoplirenla. often 
lead to intellectual detcrioratiem, hut there arc others in wdfich the 
patient may sufl'er no impairment of abilities. 

As in the case of feeblemindedness, there is no sharp dividing line 
between “insanity” and normality (63). Distinctions arc made for 
practical purposes of confinement, trealnjcnt, and similar reasons, 
but close examination reveals a continuous, unbroken gradation from 
the thoroughly well-adjusted person to the conspicuously insane. 

i-roi a brief icpou of the fcse.ir,.h ptojeer m which ihcsc drawings weie col- 
lected, cf , 1. 

For a classification and descrip<ion of psychotic disorders, cf. any lecent stand- 
ard textbook on abnonnLil psychology, such as 18. J9, 5 (). 63. 



The Subnormal 


559 



Fig, 86. “Astronomical Diagram’' by a Paranoid Patient. (From the authors’ collection.) 
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Differential Psychology 


Psychotic symptoms differ in degree from the behavioral peculiarities 
of the normal individual. From the blissful optimist who trusts im- 
plicitly whomever he meets, to the paranoiac who believes that the 
stranger who accidentally brushes against him is plotting his demise, 
there are all degrees of “suspiciousness.” The same may be said of 
other characteristics of the insane. A good example of this is the 
familiar case of the student who, upon reading a manual of psychiatry 
or attending a course in abnormal psychology, believes himself to be 
afflicted with each form of psychosis in turn. Most of us can discover 
in ourselves at least one characteristic of many types of insanity, in 
mild form. It is not normal, in the statistical sense, to be entirely free 
from all such slight peculiarities. 

An important distinction from the viewpoint of the psychologist is 
that between organic and junctional disorders. Briefly, organic dis- 
turbances are those which can be definitely correlated with a struc- 
tural deficiency. In functional disorders, on the other hand, there 
seems to be only a faulty operation or deficient action of apparently 
normal structures.'^ Thus paresis has been definitely traced to the 
influence of syphilitic infection upon the nervous system; a group of 
psychoses have been shown to develop from excessive utc of alcohol 
or drugs; injuries, or lesions, in certain parts of the brain or lower 
nerve centers lead to characteristic behavioral deficiencies. There re- 
main, however, a large nurpber of psychotic conditions for which no 
physical basis has been discovered. These constitute the most common 
types of psychoses, including schizophrenia and the other psychoses 
discussed in the early part of the present section. 

A few psychologists and psychiatrists are of the opinion that the 
physical bases of such behavioral disorders are only undiscovered 
and that ultimately all will be adequately explained in structural 
terms. There is a growing conviction, however, that these disorders 
may be purely functional in their origin, involving no structural im- 
pairment. If this be the case, we should seek the causes of such 
psychotic conditions in the mechanisms of learning and in the environ- 
mental conditions which have surrounded the individual throughout 
his lifetime.’ The question is still a controversial one, to which a 
conclusive answer cannot yet be given. The fact that no differen- 

This is essentially the same distinction as that between structural and func- 
tional determination made in the section on feeblemindedness and m Chapter 4. 

For a vivid analysis of the way in which such disorders might be built up 
through learning, cf. 87. 
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tiating physical characteristics have been discovered cannot, however, 
be attributed to a dearth of data. Investigators have certainly tried 
to find such dift’ercnccs — in the brain or other parts of the nervous 
system, in the endocrines, in the chemical composition of the blood, 
and in innumerable other physical factors — but the findings so far 
have been negative (18). 

One finding which some psychologists have regarded as strong 
evidence for the heredity of psychoses is tliat such conditions tend to 
run in jcuuilics, d he most extensive surveys on this question are those 
conducted by Kallman (44, 45, 46, 47). In the first of tliesc surveys, 
Kailman obtained data on the relatives of 1047 schizophrenic patients 
a.imiticd to the Herzberge Hospital in Berlin between 1803 and 1902. 
One of the results of this study concerns the incidence of schizophrenia 
among parents and children. Within the entire group of families in 
which both parents were schizophrenic, 68% of all the children were 
also schizophrenic. When one parent was schizophrenic and the other 
schizoid, the propojtion of schizophrenic children was 24%*, when 
one parent was schizophrenic or suspected of schizophrenia and the 
other normal, 15%/, and when both parents were themselves normal 
but had schizophrenia somewhere in their ancestry, 9%. Among iinse- 
Icctcd families, the proportion of schizophrenic children would of 
course be about 0.85%, since that is the incidence of tlie condition in 
the general population. 1 licsc results are of considerable interest and 
value in themselves, but they do not, of course, provide any more 
conclusive evidence for cither heredity or environment than do other 
similar data on family reseuiblanccs discussed in Chapter 10. 

In the same study, further data were presented on the half-brothers 
and half-sisters of schizophrenic pau/nts (44). These figures showed 
that when the common parent was schizophrenic, the incidence of 
schizophrenia among the half-siblings of the schizophrenic patient 
was much higher than when the common parent was normal (24% 
vs. 2%). Such data seem to provide a sort of environmental control, 
since the half-siblings had been living in the same family. There are 
diflicullies, however, in these compaii “US. First, the number of half- 
siblings in the two categories was fairly small. Secondly, the com- 
parison is not quite so clear cut as might at first appear, since within 
the group with a common normal parent there would obviously be 
more families in which one or both parents were normal than there 

noimal peisonahty showing mild schizophrenic behavior tendencies. 
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would in the group with a common schizophrenic parent. In other 
words, the total family environment of the group with a common 
normal parent may also have been more favorable and normal. 
Thirdly, the contact between parent and own-child may have been 
psychologically closer than that between parent and step-child. Fi- 
nally, we cannot ignore the possible effects upon the half-siblings of 
their knowledge concerning normal or abnormal parentage; nor can 
we ignore the possible inlliience of such knowledge upon the reactions 
of family and associates toward the sil lings. 

Additional analyses were conducted on twins (46, 47). In iden- 
tical twin pairs in which one twin had schbophrenia, 82% of the 
co-twins were suffering from the same condition. In non-identical 
pairs, the per cent was 12.5, and among siblings 1 1.5. In interpreting 
these figures, all the factors which tend to make the environment 
of identical twins more alike than that of fraternal twins or siblings 
must be taken into account (cf. Ch. 11 ). In a later survey (45) con- 
ducted by Kallrnan in mental hospitals in New York Slate, 691 twin 
pairs were located in which at least one member of each pair was 
schizophrenic. The relative frequency of schizophrenia among the 
co-twins and other relalives of the schizophrenic twanf closely cor- 
roborated the earlier findings. Within the lotal group of 174 identical 
twin pairs, 85.8% were concordant, i.c., both members were diag- 
nosed as schizophrenic. When 59 pairs of identical twins who had 
been separated for 5 years or more prior to the onset of the psychosis 
were compared with the remaining 115 non-separated pairs, the 
incidence of concordance was 77.6% for the former and 91.5% for 
the latter. 

Several case reports of identical and non-identical twins are also 
given by Kallrnan. Among them is the account of a single pair of 
separated identicals, both of whom developed schizophrenia at the 
age of 24 although they had been reared apart since infancy (44, 
pp. 207-209). In contrast to this case is that of a pair of fraternal 
tw'ins, one of whom developed an acute and deteriorating form of 
schizophrenia, while the other remained normal, despite the fact that 
both had been exposed to the same extremely unfavorable environ- 
ment (47). The normal twin in this pair was also physically stronger 
and more mature, a finding which Kallrnan reports f^r the majority 
of twin pairs with only one schizophrenic member. This diflerence 
in physique could, of course, operate as an environmental factor in 
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the subsequent development of the psychotic condition (cf. Chs. 1 1 
and 12). On tlie whole, these data are no more conclusive for the 
question of heredity than most investigations on family resemblances 
and differences. Moreover, the fact remains that if there is a heredi- 
tary basis to schizophrenia, we should be able to show H'hat it is that 
is inherited. 

Similar studies on twins and other family relationships have been 
conducted in the case of other psychotic disorders, such as manic- 
depressive psychoses (cf. 56). In general, their findings arc similar 
to those obtained with schizophrenics, and their interpretations are 
subject to the same limitations which have been discussed above. 
Psychoses do “run in families,’' but more than that we cannot justi- 
fiably CiMicludc. 

Neuroses. The neurc»ses, also known as “psychoncuroses,"’ may 
be regarded as milder forms of peisonality disor^ler than tne psy- 
choses. I hey arc also more generally considered to be functional or 
“psychogenic” in origin than arc the latter. In their specific manilesta- 
tions, they bridge the gap between the slightly maladjusted indi- 
vidual on the one hand and the distinctly ps\ cholic on the other. On 
the basis of the normal dis(ril)ution of behavioral characlcristics. we 
should expect neurotics to !">“ moiC numerous than psyeliotics, since 
they are nearer the center of the curve. 1 his seems to be ciuite clearly 
borne out by actual statistics. Precise data on the incidence of neu- 
roses are dillieult to obtain, h.ovvever, because (ff inconsistencies in 
diagnostic criteria or standards and because most neurcHics arc not 
insdlutiouali/ed. 

b’or conxenienee of classilicalion. neurotic symptoms have been 
grouped into a few major clinical pi- lures, it should be ixu'ne in mind, 
ho>,\cver, tint individual neiio'ties may be quaie unique and that pure 
“textbook cases” aie rare — probably even rarer among rieun tics than 
among psychotics. Among the most commonly employed categories 
are psycliaslticnia, h>:>teria, neurasthenia, and anxiedy neuroses ( J <S, 
19, 56, 61, 63). Tyffictii symptoms of p\ u hastliciua include obses- 
sions (persistently recurring thoiighv . phobias (unwai ranted fears ), 
and compuNions (e.g., continuous and needless washing of hand^). 

Hystcfia is generally characterized by loss or impainnent of bc^dily 

’"This is used here bet lus'" it is very edmnK'Mi iind likely to be 

cneoiiiitered bv the student y\ nuirbei o\ i»s\cb<domsis have been lakne, steps toward 
a much ncciietl Te-.b.ssjfu auon of b-hav-oi d soidcis, but The cLiSsif'Kat.o.i is still in a 
state of flux (ct , , Cameron. 19). 
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function, but in some cases may involve such symptoms as amnesias 
or “trance states.” Among the most typical hysteric symptoms are 
loss of movement in some part of the body and sensory impairments, 
such as deafness, blindness, or skin anaesthesias. These physical 
symptoms of the hysteric, however, are entirely functional in origin, 
and can readily be distinguished from organically caused paralyses 
and loss of sensation. Hysterical symptoms are often anatomically 
impossible, as in the “stocking” or “glove” anaesthesias, which repre- 
sent bodily regions that are unitary onlv in popular thought but do 
not correspond to the known distribution of nerves. Sometimes such 
symptoms arc intermittent and occasional, and may be manifested 
only in the presence of certain individuals or in a particular locality. 
Another distinguishing feature of hysterical symptoms is their sus- 
ceptibility to a wide variety of “cures” based upon suggestion. Under- 
going practically any sort of experience in whose cHicacy the patient 
believes has been known to produce many sudden and startling cures. 
The history of such cases usually reveals a certain obscurity of diag- 
nosis; physicians were bafllcd, the case was declared hopeless or a 
mystery. Needless to say, such a case history adds to the glamour of 
the “cure.” What this really shows, of course, is the functional nature 
of the disorder which naturally defied organic diagnosis and treatment. 
The popular prestige of many unscrupulous charlatans is built upon 
their success in “curing” th^? apparently physical disabilities of certain 
hysterical patients. 

Neurasthenia represents a condition of intense mental and physical 
fatigue induced h\ prolonged emotional jnaladju:,tnient. The indi- 
vidual feels “fagged out” and “a general wreck,” olten suffering from 
insomnia, lassitude, loss of appetite, and phvsical complaints of a 
hypochondriacal nature. Afi\ieiy neuroses are characterized by at- 
tacks of intense fear, together wath the physical symptoms that accom- 
pany this emotional stale. An example is “combat fatigue,” which 
has also be^n classified as a traumatic neurosis, i.c.. a neurosis result- 
ing from a severely disturbing or injurious experience. Recurrent 
nightmares and an exaggerated startle response to such mild stimuli 
as the slamming (>f a door arc characteristic symptoms of this well- 
known war neurosis. 

Neurotic symptoms are no less “real” because they are functional. 
The subject may sutfer just as acutely and be as seriously handi- 
capped as if he had a definite structural deficiency. Similarly, such 
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disturbances cannot be overcome merely by voluntary effort. Nor 
should they be confused with malingering. The subject himself is 
undergoing as vivid an experience as if he had an organic disorder 
and he may be completely unaware of the fact that his symptoms 
have no structural basis. 

Like psychoses, neuroses show little relation to intelligence and 
probably occur at all inlcllectual levels except the lowest (39, p. 975). 
Biographies of professionally eminent persons wiio have developed 
acute neurotic conditions bear witness to the fact that neuroses arc 
not incompatible with very high intelligence.*^ A few studies with 
personality inventories do suggest that the more inteliigent individuals 
arc less likely to give certain neurotic responses (15. 60), but this 
might result from a more so[>histicated response to the test items 
on the part of the brighter individuals. Mi'irecver, dillcrenees in socio- 
economic level, w'hLh are correlated with intellectual stains, may ac- 
count for any observed differences in neurotic tendencies. The latter 
explanation may also apply to possible dillcrenees in the relative 
IrcLjucney of each type of neurosis at various intellectual levels (39, 
p. 975 ). individuals who, by virtue of differences in education, occu- 
pation, and the like, are exposed to different situations are likely to 
differ in the nature of sympioms which they develop. 

It has frecjucntly been said that neuroses arc the result t>f the stress 
and strain of modern living, especially in the more hectic urban and 
metropolitan centers. I'lus may be partly true, but the statistics usually 
quoli d in support of such statements should be examined with con- 
siderable care. With impi vWements in methods ol diagnosis and 
facilities for treatn:ent, many more neurotics arc rccf>gnizccl as such 
lod.<iy than were recognized t\vent\ or forty years ago. Many neu- 
rotics are not sutficienlly maladjusted to attract much attention or to 
demand urgent treat merit. Heretofore, such individuals may have gone 
llieir unhappy vvay, piohably unpc^pular or disliked among their asso- 
ciates, but bearing their dillicultics unlabcled and unrecorded. The 
more highly developed the methods of diagc.osis, the milder will be 
the disorders which can be dcteetL • and the rivirc numerous tlie 
individuals classified as neurotic. The same argument applies to the 
statistics for urban centers, where psychiatric facilities are much 
better than in rural areas (57). 

cp, Leonard, W. i. Ilw Lo< o/nuiivi' Ck^J N. V: Applelon Cenluiy, 
1927. Pp. 434.’ 
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ABNORMALITY IN DIFFERENT CULTURES 

Varieties of Normality. Psychologically, all behavior follows nor- 
mally from its antecedent conditions — there is no essential distinction 
between the mechanisms or psychological principles of normal and 
abnormal behavior. Abnormality is the normal consequence of certain 
stimulating conditions and structural characteristics. Behavior is ab- 
normal only in the sense that it deviates from a norm. This norm is 
determined by the specific conditions of life within a given group. 
Thus it follows that behavior which is considered abnormal in one 
culture may be normal in another. 

Cultural standards enter into the definition of normality in at least 
two ways (31 ). First, the position of the norm and the line of demar- 
cation between normality and abnormality may dilfer from one group 
to another. As a result, any given behavioral manifestation may 
occupy a very different place in different distributiims of behavior. 
'Fo take an illustration from physical traits, if we ask whether a man 
is tall or short, we may obtain very different answers when different 
groups arc employed as standards. The same individual might be 
abnormally tall when referred to the distribution of Weight in the 
Japanese and very short when referred to the Scandinavian distribu- 
tion. Similarly, in certain groups violent displays of emotion arc the 
rule and stolidity wcmld be abnormal. In others, the reverse is true. 
The range of variation over which normal behavior may occur can 
also differ. Thus two cultures having the same norm may diller in 
the degree of deviation from this norm which is possible without 
maladjustment. In one, rigid adherence to a narrowly defined be- 
havioral norm may be required, either because of tradition or because 
of the exigencies of the physical environment. In another, wider lati- 
tude and larger individual differences may be acceptable as ‘'normal.” 

In the second place, culturally established standards may determine 
which emJ o/ the ilistnhntiim is superior and which siilmortnaL Com- 
parative anthropology provides many examples of behavioral devia- 
tions wiiich are regarded as iinadaptivc, pathological, insane, or 
mentally deficient in one culture and are admired or revered in an- 
other. Such behavior may be abnormal in both cases, in the statistical 
sense, but its social evaluation and practical value in the different 
cultures place it at opposite ends of the scale. This point was clearly 
expressed by Benedict (8), who wrote: 
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. . . il is probable that about the same range of individual tempera- 
inents arc found in any group, hut the group has already made its cul- 
tural choice of those human endowments and peculiarities it will put to 
use . . . the misfit is the peiM>n whose disposition is not capitalized by 
his culture. ... It is clear that there is not possible any generalized 
description ot '‘the” deviant — he is the representative ol that aie of human 
capacities that is not capitalized in his cultiire (p. 2-1). 

The same point of view was further elaborated in a later article 
(9) by Benedict as follows: 

One ol these problems i elates to the customary normal-abnormal cate- 
gories and our conclusions regarding them. In how far aic such catcgi^yrics 
C'duirallv determined, or in how ftir can we with assurance ii'gard them as 
ahsolute? Jn how far can ve regard inability to I unction socially as diag- 
nostic of abnoimality, or in how far is it necessary to regard this as a 
function of the culture? 

7\s a matter of fact, one of the rnosf striking facts that emcige from a 
study of widely varying cultures is the ease with which our abiiormals 
function in other cultiiics. It does not matter whiil kind of “abnormality" 
we choose for illusuaiion. those which indicate extreme instability, or 
those which are more in the nature of charaicter traits like sadism or delu- 
sions of grandeur or of pcrwcutioii, there ate well described ciiilures in 
which these abnormals 1 unction at ease and with honor, and appar'ently 
without danger or dUlicultv to the society fp. 60). 

Among the natives of Dobii, an island in Melanesia, fear, sus- 
picion, and mutual distrust characterize the attitudes of tlie entire 
group (32). They lake constant precautions against being poisoned 
or liaving their propcity removed by sorcery or trickery. Within our 
culture such behavior would bo dcHuibed as paranoid, but it repre- 
sents a normal adjustment to the I uTiuan culluic. Illustrations can 
easily be multiplied ( 1 0, 51, 52, 54, 56). 4 he cataleptic seizures 
constituting an important part of the behavior of the Siberian shaman 
and the homosexual practices common in many American Indian and 
Siberian communities represent other ilkislrations. Trance stales arc 
a normal part of the bchaxior repertory of certain American Indian 
groups, and it is Xhc individual who is .mable to experience the trance 
who is the deviant 2'^ Epileptic seizures, excessive daydreaming, and 
withdrawal characterize the superior deviant in certain cultures, rather 

’'•H'f, c ir . Ilie interesting biography of Mi«'h a deviant recoided in Itadin. P, ed. 
Crashini^ Thunder; the AntohiUf^iuphy of an American hidiun. N. Y : Appleton- 
Century-Crofts, l‘J26 Pp. 202. 
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than being a source of maladjustment. To be sure, the “significance” 
of such behavior for the individual difl’crs from that in our culture. 
But this is just what we mean by saying it is normal in one culture 
and abnormal in another. In one case, the individual is behaving in 
a manner which is sanctioned and overtly encouraged by his culture; 
he is conforming to the accepted and institutionalized pattern. In the 
other case, he is not.-^^ 

Varieties of Abnormality. All cultures have their deviants and 
their maladjustments. But the form which such maladjustments take 
may vary widely with the cultural setting (4, 10, 51, 52, 56). In the 
wincligo psychosis among the Ojibwa Indians, the individual believes 
he has been transformed into a windigo, a mythical cannibalistic giant 
made of ice (55). The condition usually begins with a state of de- 
pression and often develops into violence and compulsive canni- 
balism, in which the individual may kill and eat the meiTibers of his 
own family. Other familiar examples include arctic hysteria, found 
in northern Siberia, in which the individual sliows a high dcgiee of 
suggestibility and compulsively imitates the words and actions of 
those in his vicinity. A similar condition found among the people of 
Malay is known as latiili (70). Also characteristic of«t!ie Malayan 
culture is amok. I he person who "Tuns amok” attacks in a blind rage 
everyone he meets, frequently injuring or kiliing many before he 
is stopped. The innuence cultural factors upon the specilic nature 
of deviant or maladjusted behavior was also illustrated by a survey 
of the neuroses observed among native African troops during World 
War 11 (65). 'l lic relative frequency of certain types of symptoms, 
such as phobias and hysterical symptoms of a motor or sensory 
nature, and the almost complete ab>encc of other conditions, such 
as anxiety stales, could best be understood in terms of the particular 
tribal beliefs and traditions. 

We need.not go to peoples for illustrations, but can find 

them in our own cultural history (cf. 52). \ he dancing manias which 
swept over whole villages in the Middle Ages arc a form of neurotic 
behavior having no direct counterpart today. Many of the mani- 
festations of witchcraft provide further illustrations, d'hc trance states, 

lio Wegrocki (88) has ar^ticcl against the cultural and statistical concept of 
abnormality on the giounds that the .same behavior may be indicative of severe 
mal ad Ills! me lit in one ciilliue and ol good adinstment m anolhei. Far fiom being 
a criticism (if the cultural concept of abnormality, this lollows directly from it, as 
indicated in the above discussion. 



The Subnormal 


569 


the hysterical insensitivities such as the “devil's claw” — an insensi- 
tive spot on the skin often used as “evidence” in witchcraft trials — 
were all part of a clinical picture which fitted into the culture of its 
time. A further example of such “fashions in abnormality" is the deli- 
cate, languishing type of illness oi unknown origin which was so 
common among Victorian gentlewomen. 

AHNORMALtTY IN INFRAUUMAN ORGANISMS 


Other species have their deviants too. Mental deficiency, as well as 
‘‘unadapti\e" behavior which can be characterized as psychotic or 
neurotic, has been noted in many tinimal forms. Homosexuality has 
been observed or experimentally induced among doves, pigeons, 
guinea pigs, white rats, and monkeys (cf. 36. 4i ). Several investiga- 
tors working with i-ionkeys have reported instances of othci types 
of abnormal bchtivior, such as habit residuals, temper tantrums, in- 
fantile reversions, and various forms of sexual perversions (30, 36, 
84). These constitute abnormalities in the sense that they difi'er con- 
spicuously from the usua.l behavior of the species. Whenever the 
etioU^gy of such abnormal behavior could be definitely traced, experi- 
ential or environmental factors were found to play a predominant 
part in its development (30, 36, 41, 84). 

In the course of his conditioning experiments with dogs, Pavlov 
(67) observed several instances of distinctly neurotic behavior. Such 
behavior appeared when the animal was required to make too fine a 
sensory discrimination, or ' j set up loo many conditioned reactions 
whthin a short time, or to establish a conditioned reaction when the 
two stimuli were separated by toc' long an interval. The neurotic 
behavior included violent emotiona: display and loss cf previously 
established discriminations. These early observations of Pavlov stim- 
ulated a number of investigators to explore the problem further, and 
the production of *\'xpei'imental neuroses'’ in animals has become 
a common research procedure (59). Several modifications of the 
Pavlovian conditioning technique h * been employed, as well as 
a number of conllict-producing situations. The neurotic behavior 
observed varies from sleepiness, inertness, and rigid immobility to 
hypersensitivity and ovcractivity, sometimes reaching manic excite- 
ment. Symptoms resembling chronic anxiety, phobias, infantile regres- 
sion, compulsions, and hallucinatory phenomena have been described. 
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From earliest times, man must have been aware of the genius 
in his midst. In order to be recognized as a genius, the individual must 
display an unusual degree of the talents demanded by his culture. 
Since only the extreme deviates attract notice, they seem by the very 
rarity r»f their attainments to stand off from the rest of mankind and 
to constitute a distinct group. With the advent of more objective 
methods of observation and the development of testing techniques, 
the presence of lesser deviates who bridge the gap between the aver- 
age man and the person of rare gifts has been demonstrated. The 
popular concept of genius as a separate ‘'species’' probably arose in 
the same fashion as the similar belief regarding the feebieminded, and 
it is slowly being dispelled by the same methods. 

The relationship between genius and eminence is a curious one. 
Many writers identify the two by the simple expedient of defining 
genius as the possession of “what it takes’’ to become eminent in our 
society. The eminent man is then considered a genius ipso fario. 
There would thus be as many kinds of genius as there arc ways of 
succeeding in the particular society. The successful financier, for 
example, may be awarded an honorary university degree for his 
“financial genius,” the victorious general for his “military genius.” 
Society often creates a new form of “genius” in order to rationalize 
its allotment of eminence. 

Almost any theory regarding the nature of genius could, of course, 
be defended by restricting the term '‘genius” in some arbitrary way. 
The broadest and most objective definition of genius is that of an indi- 
vidual who excels markedly the average perjormance in any field. 
Social evaluation, however, invariably enters into the concept. Genius 
is defined in terms of specific social criteria and a cultural fiTimc of 
values. In our society the more abstract and linguistic abilities are 
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considered the “higher'’ mental processes. Similarly, certain lines of 
achievement enable the individual to earn the appellation of genius 
much more readily than others. I'hus academic and scientific work, 
literature, music, and the visual arts arc rated higher than, let us say, 
roller skating or cooking. 

To be sure, very exceptional accomplishments in the latter fields 
might be recognized as genius, after a fashion. An internationally 
famed roller-skate acre)bat or a renowned chej-dc-cuisinc might be 
called a genius and ranked higher than a mediocre scientist or painter. 
But in the iormer instances, the aitninnicnis itiusi he proportionately 
jar eater than in the latter in ortler that the individual may be desig- 
nated a genius. And even \\hen the term ''genius’' is applied to such 
cases, one feels that it is done only by courtesy and that the word 
is implicitly enclosed in quotation marks. It is apparent, therefore, 
that in order to ha\e practical meaning any delinition of genius must 
recognize the selection of significant talents which has been made 
within a given cultural group. 

A further question winch has been vigorously debated is that of 
t^eneral vetsiis specific genius. Is the man of genius one who manifests 
a well-rounded iiitelleciual superiority c')r one who possesses a highly 
speciali/ed gift? It follows freun what we know about the organiza- 
tion of abilities that this distincticm is not a v*iliu one. Since the 
intercorrelations of diverse abilities are neither liighly positive nor 
highly negative, we should expect all degrees of generality of genius. 
A few individuals may excel highly in a large number of traits and 
tlius appear to be all-arcu nd geniuses, as in the classic example of 
Leonardo da Vinei. Some will excel in only a lew trails, and still 
others may have a single talent w k'h is suliicienlly pronounced to 
put them in the category of genius. 

THbORirS ON TUI NATfRI- OT CibNIOS 

Theories on the nature and causes of genius are legion. The genius 
has been credited wuth a wide va iefy of attributes, ranging from 
divine inspiration and a superhuman ‘'spark” to imbecility and in- 
sanity. Among these diverse theories it is possible to discern four 
underlying viewpoints. These will be designated the patholoi^ical, 
psychoancilytic, qualitativesuperionty, and qiiantitalive-sapenority 

theories. 
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Pathological Theories. Pathological theories ^ have linked genius 
with insanity, “racial degeneracy,” and even feeblemindedness. Such 
theories dale back to ancient Greece and Rome. Aristotle noted how 
often eminent men displayed morbid menial symptoms, and Plato 
distinguished two kinds of delirium: one being ordinary insanity, and 
the other the “spiritual exhalation” which produces poets, inventors, 
and prophets. The juror poeticus and ainahilis iusanici of the Romans 
had reference to the same phenomenon. Demoeritus was among those 
who argued for such a relationsliip. it was Seneca who inspired 
Dryden to write his well-known line regarding great wit and madness 
being near allied. Lamartine spoke of the "inuhulie jucntale qu'on 
appellc genie/' and Pascal maintained that 'TcAlrcine espiit est voisin 
de rexneinc jolie/' In 1836 Leiui shocked the literary world by de- 
claring that physiological evidence furnished by the life of Socrates 
left no doubt but that the "‘father of philosophy'’ was subject to 
trances, attacks of catalepsy, and to false perceptions and hallucina- 
tions, constituting what Ixlut termed “sensorial or perceptual mad- 
ness.” Ten \ears later, Lelut reached a similar conclusion about 
Pascal, calling attention to the latter's religious visions and hallucina- 
tions. This early work of I-elut provided an important ♦>tinuilus for 
later theories of genius and insanity, as well as for a host of other 
similar analyses of the pathological traits of eminent men. 

The latter half of the nineteenth century was the golden age of 
pathological thec'ries o( genius and witnessed the publication of many 
weighty tomes on the subject. Some of the leading exponents of the 
period were Winslow, Moreau de Tours, Mobius, Nisbet, and Nor- 
dau. This viewpoint reached its culmination in the work of the 
Italian anthn'pologist Lombroso (37, 38). His bot)k entitled The 
Mail of (jenius W'as translated into several languages and read widely 
at the turn of the picsent century. Lombroso attributed to the genius 
certain pliysual siigwaia, allegedly indicative of atavistic and degen- 
erative tendencies. Among such stigmata he mentioned short stature, 
rickets, excessive pallor, emaciation, stammering, left-handedness, de- 
layed development, and originality! He also maintained that there 
were certain similarities between the creative act of genius and the 
typical epileptic seizure. 

Among modern exponents of modified versions of the pathological 

^ For a survey of this extensive literature, wth special releicnce to literary and 
aitistic genius, cf. Anasiasi and Foley, 1, pp. 65 fV. 
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theory of genius, the most outstanding are probably Kretschmer (33) 
and Langc-Eichbaum (34). The former has maintained that for true 
genius exceptional ability is not enough. He writes, “Jf we take the 
psychopatihe factor, the ferment of demonic unrest and psychic ten- 
sion away from the constitution of genius, nothing hut an ordinary 
gifted man would remain” (33, p. 28). Jn addition, Kretschmer 
applies his constitutional typolog\ (cf. Ch. J2) to the problem of 
genius, arguing lor a qualitative distinction between the achieve- 
ments ol leptosoine and pyknic geniuses. 1 he schi/(Hh\nne leptosome, 
he claims, will tend toward subjectivity, as in lyric poetry or expres- 
sionist art; the cyclothymc pyknic, on the other hand, allegedly in- 
clines more toward realistic painting, narrative epic poems, and 
the like. 

I he most extensive modern contribution to tlie pathological theory 
of genius has undo'ibtedly been made by Lange -Eiebbaun, (34, 35). 
In his Genie, Irrsinn inul Rnhm, published in 1928. he brings together 
the biographies of 200 men and women of genius from all countries, 
peric^ds, and fields of endea.vor. All these biographies contain refer- 
ences to alleged abiKumialities of their subjeefs. I lie reports are fully 
doeumemed with a bibliography of over I (>00 references, but vary 
in length Irom several pages to the simple^t comment such as “for 
a King time psychotic.” Lange-r’ichbaum grants that there is not an 
in\ariable or necessary association of genius wn(h insanity. At the 
same lime he insists that those geniuses who have not sulfcred from 
mental abnormalities are few. Among liiis .small minority he cites 
Titian, Raphael, Andrea dc! Sarto. Riiliens, l.eibnil/, and a few 
others. From his sur\cy he conclutles that although the proportion 
of the general population who arc psychotic is about 0.5%, among 
geniuses 12% to 13%, have been [ syehotic at least once during them 
lifetime. Confining his analysis to the 78 “greatest n.irnes” in his list, 
he finds that more than 10%? psychotic once in their life- 

time. More than 83%' i^^ive been either psychotic or markedly psycho- 
pathic, more than 10% slightly psychopatni. , and about 6.5%, healthy. 
When only the 35 names representing “the greatest geniuses of all’' 
were selected, 40% fell into the psy Jiolic category. Over 90% were 
characterized as either psychopathic or psychotic, and about 8.5%? 
normal. 

Langc-Eichhaum’s explanation of the association of insanity and 
genius is threefold. First, the pathological condition is said to increase 
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the strength of the individual’s emotions and his responsiveness to 
minute stimuli, and to decrease his self-control — all of which may 
result in experiences which “normal” persons do not have. Secondly, 
Lange-Eichbaum maintains tliat those suffering from these conditions 
are likely to experience more unhappiness and feelings of inferiority, 
which motivate them more strongly. Finally, the tendency to a richer 
fantasy- and dream-life, associated with some of these disorders, may 
be conducive to creativity of expression. 

In evaluating the evidence cited in support of pathological views 
of genius, several factors must be taken into account. First, in most 
of the studies, the evidence consists of selected cases. Some indi- 
viduals could, of course, be found to illustrate almost any theory. 
The real test of the hypothesis must be based on a completely un- 
selected sampling of geniuses. The survey of l.ange-Fichbaum is 
probably less subject to such selective factors than many other such 
studies, but it is not entirely free from them. 

A second point is that man> geniuses may become malad|iisted 
in a society built up around the average man and his needs, dhis is 
particularly noticeable in (he case of a very superior child placed in 
a class of mediocre school children. It is profiably tru^ of supenor 
adults too. In such a ca^e, the maladjustment would be an induce t 
result rather than a caioe or an essential component ol genius. A 
different although related consideration is that the genius, by viitue 
ot liis superior abilities, may be more keenly aware of shortcomings 
and injustices which he observes and thus subjected to more emo- 
tional “wear and tear."’ Jt has been said that a sensitive and imagina- 
tive person cannot Inc as calmlv as a storekeeper (6! ). 

Cjeniuses, moreover, arc olten reeaided as pathological by their 
fellow -men until the inaictical benefits ol their woik become tangible. 
Their undertakings are often misunderstood or ridiculed until tiieir 
.’success is demon' trated. 1 he familiar example ol Thilton and his 
steamboat is a case in point. In the ptisU the genius has at times met 
with organized and violent opposition or even peisecution I ife under 
such conditions is not very conducive to the devek'pment of a stable 
and well- ad justed personality. It should also be noted that, even wlien 
the genius is rccogni/cd and acclaimed as such, he is likely to be 
surrounded by such a glare of publicity that all his actions and idio- 
syncrasies become common knowledge. As a result, any behavioral 
deviation too slight to attract attention in a less outstanding individual 
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is pounced upon, discussed, and elaborated until it may assume the 
proportions of a neurotic or psychotic symptom. Finally, the cultural 
setting in which the particular man of genius lived must be considered. 
It is misleading to evaluate the behavior of a thirteenth- or sixteenth- 
century genius in terms of present criteria of abnormality. Trances 
and visions, for example, were not so unusual at one time in our 
history as they are today, nor did they have the same signihcance. 

Psychoanalytic Theories. In common with the more recent modi- 
fications of pathological theories, psychoanalytic conceptions of 
genius emphasize motivational rather than iiiiellectual characteristics 
(15, 22). Although admitting that a high level of ability is essential, 
some psychoanalvsts regard this aspect of genius as a ‘'psychological 
riddle'’ (19) and concentrate upon motivational factors. Others have 
taken the more extreme position dial the genius docs not dilTcr in 
ability from the ordmarv man, but diflcrs only in what he docs with 
his ability under strong motivational urges ((>3). Among the psycho- 
analytic concepts which have been most frequently applied to an 
explanation of genius arc subimiation, compensation, and “uncon- 
scious processes” in creative production. 

By sublimation is meant that the artistic or scientific achievement 
serves as a substitute outlet for thwarted drives, often of a se^iual 
nature. The familiar illustration of the poet who composes a love 
lyric when he is frustrated in love comes to mind. But many of the 
specific cases to which some psychoanalysts have tried to apply this 
mechanism are much more far-fetched and seem rather forced. Com- 
pensation for real or inio^ined infei iorilies has likewise been pro- 
posed as the principal clue to the aceompli'^hmcnts of genius (63). 
A favorite illustration is that of gre < orators who, like l^cmoslhcncs, 
developed their talent as a compensaiion for an initial habit of stam- 
mering or a similar speech defect. It has also been suggei^ted that 
Beethoven compcjsed his greatest works after he became hard of 
hearing, and tliat he probably had a hearing defect even in early 
life. As a result, his interests were allegedly centered upon auditory 
experiences from an early age and began a regimen of intensive 
training which culminated in his ouistanding musical achievements 
(63, p. 119). Like sublinvition, compensaiion can probably help us 
to understand the motivation of some geniuses, but it should not be 
applied indiscriniiiiatcly to all cases. 

A number of cre:itive worker.^, especially artists, have provided 
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accounts of their own creative experiences. Some of these accounts 
refer to production under trance-like states and to the automatie, 
apparently uncontrolled appearance of creative ideas. This the psycho- 
analysts have regarded as evidence for their theory of the importance 
of “unconscious processes” and the part which such processes play 
in creative work. The number of persons who have written such 
introspective accounts is, of course, small in comparison with the 
total number who have achieved eminence in art, science, and other 
fields of endeavor. Artists, by the very nature of their profession, arc 
more likely to dramatize their own experiences than are other types 
of creative workers. A sobering contrast to such dramatized accounts 
is provided by the results of Rossman’s inquiry among 710 active 
and successful American inventors (49). This inquiry, which was 
supplemented with information obtained from research directors and 
patent attorneys, covered both the characteristics of inventors and 
the nature of the inventive process. No part of this study lent any 
support to the popular notion of invention as a speclaeular event. 
For this group c'f inventors, the creative experience was on the whole 
a very methodical, systematic, and matter-of-fact process. 

F:ven among artists, those who have spontaneoi^ly written accounts 
of their own creative experiences may be a rather atypical group. It 
is like!) that the more unstable, pathological individuals have, on the 
whole, been more interested in recording such observa.ti(^ns, just be- 
cause their experiences were more unusual and nevvs^vo^thy. I'hc 
recoids arc far from factual or objective, and any preconceived the- 
ories which the individual himself may have had could have colored 
the original account. Finally, it should be noted that many of the 
ps\vhoanalytie interpretations of the creative process as well as of 
the nature of genius are vague, confused, and mcntalistic, often mix- 
ing literal and iigurative concepts indiscriminately. 

Theories of Oisaiitntive Superiority. According to the doctrine of 
qualitative superiority, the man of genius is a distinct type dilTering 
from the rest of the species in the kind of ability he possesses. Such 
views can be distinguished from the pathological and the psycho- 
analytic in that they regal'd the man of genius as essentially superior 
to the norm. No inferiorities of any sort arc implicit in this con- 
cept. The achievements of genius, according to these theories, 
result from some process or condition which is entirely absent in the 
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ordinary man. Such current expressions as “the spark of genius” 
reflect the popular influence of this point of view. 

This approach, like the pathological, has a long history (cf. 23). 
In the ancient world, genius was frequently attributed to divine in- 
spiration. The Greeks spoke of a man’s 'Tkenion” which was sup- 
posed to possess divine penvers and to furnish the inspiration for his 
creative work. Among those who discussed genius in these terms are 
Plato and Socrates. During the Middle Ages, genius was often re- 
garded as the inspiration of a chosen mortal by llie deity or by a 
devil, the attribution depending upon the use to which the creative 
talents were put. 

Qualitative distinctions are also common in more recent literary 
and philosc'jphical writings on the subject of genius. Mvstic insights 
and unconscious intuitiems have been attributed to the man of genius. 
In this connection may be mentioned the views of Sclaopenhaucr, 
Carlyle, and linierson. In psychological discussions of genius, this 
point of view is much less common. An example is the theory pro- 
posed by Hirsch (23), in winch he diflerentiated three “dimensions” 
of intelligence. According to this theory, the first dimension is per- 
ceptual and cognitive and is shared by man and the lower animals; 
the second is conceptual and is common to all of mankind; the third 
he designates “creative intelligence” and attributes only to genius. 

Qualitative distinctions appeal to the imagination ol the public. 
The genius whom the layman acclaims dilTers so greatly from the 
rest of mankind in his achievements that he seems to belong to an- 
other species. A careful a ally is of the individual’s abilities, however, 
will reveal no essentially new' process. And only a brief unbiased 
search discloses the presence of irtermediate degrees of capacity in 
all lines. 

Theories of Otiantitative Superiority. The view that genius in- 
volves a quantitative superiority regards the genius as the upper ex- 
treme of a continuous distribution of ability. 1 he “special gifts” and 
“creative powers”' of genius are attributed, to a lesser degree, to all 
individuals. Genius is defined in i-rms of concrete, measurable be- 
havior rather than in terms of un. rii)wn entities. To be sure, the 
accomplishments of genius are not attributed to any single talent, but 
to an auspicious comlanation of various intellectual, motivational, 
and enviionmental factors. 
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It follows from this view that the origin of genius is to be under- 
stood in the same terms as that of all individual differences. Many 
investigators, such as Galton (20, 21), ferman (55), and L. S. 
Hollingworth (24, 27), have placed the major emphasis upon hered- 
itary factors. The observation that genius tends to run in families has 
probably given the greatest impetus to such a hereditary interpreta- 
tion. The powerful environmental influences exercised by family con- 
tacts and traditions cannot, however, be overlooked. In the sections 
which follow, we shall examine specific findings on genius for what- 
ever bearing they may have upon the various theories. 

METHODS rOR I HE STUDY OE GE NIUS 

Psychological investigations on the nature and development of genius 
have followed two fundamental approaches, viz., the study of adults 
who have achieved eminence and the study of gifted children. Hit 
specific pr(')ecdurcs may be further subdivided into: ( 1 ) biographical 
analysis, (2) case study, (3) statistical survey, (4) hisloriometry, 
(5) intelligence test survey, and (6) longitudinal study. Although 
any one investigator may, and frequently does, combii^e more than 
one specific method, we shall consider them independently for clarity 
of presentation. 

In biographical studies, all available published material on a given 
individual is examined in the effort to arrive at an understanding of 
the nature and origin of his genius. The investigation is limited to a 
single individual, who is usually chosen from the great men of the 
past. This method has been employed extensively by psychoanalysts, 
as well as by the exponents of pathological views of genius. The 
literature on this method runs to several thousand references 
(cf. 1, 34, 35). 

1'he rase study method consists of direct testing and observation 
of a single living individual. Because of the difficulty of subjecting 
adult geniuses to such an investigation, this method has been applied 
almost exclusively to gifted children. Several such studies on con- 
temporary “child prodigies,” including a number on juvenile authors, 
have been conducted by psychologists. 

The statistical survey method, like the biographical, is based upon 
an analysis of printed records, although differing from the latter 
method in several essential respects. The purpose of statistical sur- 
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veys of genius is to discover general trends in a large group, rather 
than to make an exhaustive analysis of a single case. All available 
information on a large number of men is obtained from biographical 
directories, encyclopedias. Who's Who. and similar sources. Ihis ma- 
terial is occasionally supplemented from biographies. But the former 
sources are employed predominantly because of the more objective, 
reliable, and standardized nature of their data. It will be noted mat 
in ihis method the criterion of genius is chieily eminence. 

The historionwiry method makes use of ail historical material on 
an individual or a group ol individuals. The data are gathered from a 
variety of sources, including biographies, directories, and original 
joeiimem*- such as letters and diaiies. d'he attempt is made to obtain 
as complete information as possible, espcciallv on the childhood 
aecompiishmenis of the great man. 'This ir/iieriai is then evaluated 
in terms of a more or less constant slandaul in order to anive at an 
estimate of the individuars traits. This method was employed by 
Woods (71) in his study of mental ami moral heiedily in royalty. 
Terman (51) sub^equclUly suggested an adaptation of hisloriometry 
wTiercby the recorded achievements are evaluated in terms of mental 
test norms ior each age and an 10 is eompiili\i. By this method, for 
example. Terman estimated that the JO of Francis Galton in child- 
hood w^as appioximately 2t)0. 

riie intelligence test survey involves the direct study of large 
groups of intellectually superior children by means of mental tests. 
Txtensive use is now being made of this method. The subjects are 
originally selected on the .^asis of intelligence test performance, and 
subsequent analyses arc made with the aid of standardized intel- 
lectual, educational, and personal v measures. A relatively recent 
development is the loneitiulinul stu^o', in which a group of children, 
originally selected because of high 10, arc followed up irUc^ adoles- 
cence and adulthood. 

Each of those procedures has its own peculiar advantages anu 
disadvantages. No one can be regarded best or poorest on all 
counts. The statistical, hislonomct.y and intelligence test methods 
can be applied to large groups, ana hence disclose general trends. 
They are also relatively free from selective bias, yielding fairly repre- 
sentative samples. The biographical and case study methods, on the 
other hand, give a more complete picture ol the individual and enable 
one to note the specific interaction of various conditions in the sub- 
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ject's development. The study of contemporary living geniuses makes 
direct observation possible and avoids the judgment errors and other 
inaccuracies which are inevitably present in historical material. At 
the same time, carefully controlled observation of living geniuses 
offers many practical clilliculties. A further disadvantage in the study 
of contemporaries is the possibility that the eminence of some may 
be short-lived and spurious and that others who arc laboririg in 
obscurity may be recognized as geniuses by posterity. 

Finally, the relati\'e advantages of studying adult geniuses and 
gifted children may be considered. I’o inv estigate intellcctualiy su- 
perior children in the effort to discover the characteristics of adult 
geniuses seems somewhat indirect. Only a small number of such chil- 
dren are likely to develop into adults who can be classified as geniuses. 
Chilcli'cn, however, are available for prolonged and controlled obser- 
vation and testing which would be practically impossible wi(h adults. 
A further advantage of the study of gifted children is that it makes 
possible a developmental approach to the problem. Such an analysis 
may go far toward clarifying the origin and nature of genius. 

statistic al SURVHVS OV LMINl-NT MI N 

Investigations of genius through statistical surveys of piinted records 
have been conducted in Hngland by Gallon {20, 21 Ellis (18), 
and Bramn’cll (4); in France by dcCandolle (14), Jacoby (32), and 
Odin (45); and in America by Cattcll (9, 10, 11). Brimhall (5), 
Clarke (12), Bow^erman (3), and Vislier (62). Castle (8) conducted 
a similar survey cm eminent women of all countries, but the data of 
this study are extremely tentative and diflicult to interpret. We shall 
examine briellv some of the principal lindings of these various surveys. 

The so( io-econoniic background of eminent men has generally 
proved to be distinctly above average. The genius who has been 
nurtured in a slum is the exception rather than the rule. Thus in 
Visher’s analysis of the occupations of the fathers of 849 “starred’’ 
American men of science," nearly half were found to be engaged in 

^ The “still red” men represent the most eminent persons listed in the Diteitory 
of Ajiiencan Men of Science Those to he starred in each field of science are <, hosen 
on the basis of nominations by scientists who had previously been starred in that 
field. The original IdOO starred men were selected in UfO.L and 250 additions were 
made in each new edition of the directory, prepared every live years. 
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the professions. This proportion is far in excess of that in the general 
population, the latter falling between 3% and 6%. The entire occu- 
pational distribution of the fathers of the starred men is given in 
Table 31. 

TABLE 31 Occupational Distribution of 
luitlicrs of S49 Starred Anicrican Men 
of Science 


( '1 Vi-1u*r, 62, p, 5V>) 


1 

Occupca ional iiump 

Per Cent 

Professions 

45.5 

Business and mcrcaiitile 

23 

Farming 

22 

Skilled labor 

S 

Unskilled labor 

1 


A similar occupational distribution is 1o be found among the 
fathers of the eminent men and women surveyed by f'llis (18). In 
Castle's study of eminent women of all limes and nationalities, it was 
reported that 33.17^ had fathers in the “learned professions’" (8). 
Tlie distribution of paternal i)ceupation found by Cox (13) in a 
group of 282 eminent men and women of all countries is shown in 
Table 32. In this group, which covered a much earlier period in his- 
tory ( 14.'>()-I850), tfie predominance of high socio-economic level 
is even more conspicuous. 

'FABLI*^ 32 Occup(flional Disti ihution of Fatturs of 282 
Eminent Men and Women of All Countries 


( I'l oiii ( \ p Cl 


OentpumnuA Cnoup 

Pei ('cut 

1. Professional .and nobility 

52 5 

2. Semi-professional, higher business, and gentry 

28.7 

3. Skilled workmen and lower business 

13.1 

4. Semi-skilled 

3.^ 

S Unskilled 

1.1 

No record 

0.7 
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The number of eminent relatives may also be considered in this 
connection. It will be recalled (Ch. 10) that in Gallon's study (20) 
the 977 eminent men investigated had a total of 739 known relatives 
who had also achieved eminence. Moreover, the closer the degree of 
relationship, in general, the more numerous were the eminent rela- 
tives. A follow-up of Gallon's study, covering three subsequent gen- 
erations and reported in 1948 by Bramwcll (4), closely corroborated 
Gallon’s findings on the frequency of eminent relatives. Similar re- 
sults were obtained in Brimhall’s investigation (5) of family resem- 
blance among American men of science. 


TABLE 33 Proportion of American Men of Science 
Born in Lastern and Midwestern States 



( Ki (ni ( 'al ( 11 
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Massacliusctts 

134 

72 


Co mice lie lit 

40 

16 


New York 

183 

128 


Pennsylvaiua 

66 

48 


Illinois 

42 

88 


MinncsoU) 

4 

32 


Missouii 

14 

40 


Nehr.'i'ik.i 

2 

20 


Kansas 

7 

32 

L. 





Certain interesting trends are suggested by Cattcll's analysis of the 
place of hittli of American men of science (cf. 9, 11). In iiis 19t)6 
report. C.atteil pointed out that cities contributed a much greater 
proportion of men of science than did rural sections. Although at 
that time the urban population was about one-sixih of the rural 
population, it produced a quarter of the scientific men. Even more 
striking is the comparison of different states which varied widely in 
their educational facilities. In 7 able 33 are shown the relative num- 
ber of scientists born in each of nine stales. These stales were chosen 
as the clearest examples of a definite trend which had been operat- 
ing over an interval of three decades. Corresponding figures are 
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shown for the original group of 1000 scientists selected in the year 
1903 and for the group of 250 elected in 1932. All figures have been 
expressed in terms of 1000 entries to permit direct comparison. 

These data suggest several conclusions which arc borne out by 
the complete results for all parts of the country (cf. 11 J. In the first 
place, there arc marked discrepancies in the relative number of 
eminent scientists born in dillereiit parts of the country. Secondly, 
these differences in birthplace correspond closely to diifcrcnces m 
educational opportunities in various sections of the country. Thirdly, 
as educational facilities change, the frequency of scientists shows a 
corresponding change. Since the turn of the century, for example, 
there has been a phenomenal development of education in the mid- 
western states. The relative (juality of education in such states has 
improved, new universities have been established, the contribution 
of state and federal funds to higher education has mounted sharply, 
the number of students in institutions of higher learning has increased 
rapidly, and a powerful tradition has been built up which fosters 
intellectual activity. On the basis of such findings alone, we cannot, 
of course, draw any inferences regarding the relative contributions 
of hereditary and environmental factors. Whether there has been a 
selective migration of intellectually superior families from New Eng- 
land to the midwestern states, or whether the improved educational 
facilities have been conducive to the development of more scientists — 
or whether hoih of these intluciic’cs have been operating — cannot be 
conclusively determined from the available data. 

Of interest in connection witli the pathological theories of genius 
is the relative frequency of iiisaniiy among the relatives of eminent 
men, as well as among the subjects themselves. In all statistical sur- 
veys in which the cases were not selected to prove a point, the inci- 
dence of intellectual and emotional disorders has been found to be 
consistently smaller among eminent men and their lamilies than in 
the general y)opulation. In the group investigated by Ellis (18, p. 192), 
less than 2% were repotted to have liad ciihcr insane parents or 
insane offspring. Among the eminent individuals themselves, Ellis 
mentions 44 cases of emotional disord-T out of a total group of 1030. 
Of these, only 13 could be definitely classed as insane during the 

^Cf., e.i?., Eclls’ analysis of the “center of population” of higher education from 
1790 to 1920, which showed a westward movement at the rale of 60 miles pei 
decade (17). 
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active period of their lives; 19 were either insane for a short period 
or manifested very mild disorders; and 12 developed senile dementia 
in old age (cf. 18, pp. 189-190). 

Other facts which have been brought to light by these surveys 
relate to age of parents at the time of birth of the child, order of birth, 
and similar “vital statistics.” It has been suggested, for example, that 
intellectually superior children arc more often born of older parents 
(48). From a somewhat different angle, Lombroso (38) claimed that 
geniuses arc the offspring of aged parents and ofl'cred this as further 
evidence of the pathological nature of genius. The data on this ques- 
tion arc dillicult to interpret because of the complicating factor of 
social level. People in the higher social classes, from which geniuses 
are most frcqiiently recruited, tend to marry later and therefore have 
children at a later age. They also tend to have fewer children, who 
thus benefit all the more from educational and other socio-economic 
advantages. For all these reasons, parental ages arc in themselves 
inconclusive. Among American men of science, Cattell (10, III) 
found 35 >'ears to be the average age of the father at the time of the 
subject's birtli. Fur English men of science, Gallon (21) found the 
corresponding ligurc to be 36 years. Ellis (IS) gi\^''S 37.J years for 
his group of Biitish men and women of eminence. In all ilicsc groups, 
however, the range (4* parental ages at the time of the subject's birth 
is extremely wide. In tlie majority of cases the parents were in the 
prime of life, contrary to Lombro^o‘s contention. 

Somewhat more conclusive is ihc analvsis of order ol birth within 
the family. In genera! the eminent individiuil is n:osl oiten the oldest or 
lirst-born child in the Eunily, Nevt in older of frequency cuaics the 
voungest child, intermediate cJiildren liaving the least chance of be- 
cenning eminent (ef. 18, 72). T hese findings are in direct contradic- 
tion to the propo‘‘ed tlieory that older paiLnb^ ha\c intellectually 
more gifted olfspiing. It would seem that, within the same family, 
the superior cIEdd is most likely to be born when tiic parents are 
younger. 1 his Ending may have an environmental explanation. The 
first-born has traditionaliy enjoyed privileges in our society that his 
younger siblings may not have haci. More is usually expected of llic 
oldest son. If a choice must be made for economic reasons, {.]\c oldest 
child is usually allowed to complete his education, in preference to 
the younger children. These conditions miglit be sulheient to produce 
a slight degree of relationsliip between birth order and achievement. 
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Motivational factors in sibling relationships may also play a con- 
tributing part, as may the fact that the tirsl-born probably receives 
more adult attention. The latter is particularly true of only children, 
who would all be classified as “first-born.” 

IIISTORIOMETRY IN THE ANALYSIS OF EMINENCE 

The childhood of great men, viewed retrospectively, has been the 
source of much controversial discussion. There is a popular belief ‘ 
that many geniuses were dull in childhood, a number of favorite ex^ 
amples being cited in support of this contention. Darwin was con- 
sidered by his teachers to be below average in intellect. Newton was 
at the bottom of his class. Heine was an academic failure, revolting 
against the traditional formalism of the schools of his time. Pasteur, 
Hume, von Humboldt, and other equally famous men were unsuc- 
cessful in their school work. 

An examination of the available biographical material in such cases 
shows that the intellectual defect was erroneously inferred from the 
level of scholastic performance within a rather narrowly restricted 
area. The intellectually superior child may be just as maladjusted in 
school as the dull or borderline case. Schools adapted to the average 
child may be unsuited to the highly gifted pupil in many ways. The 
monotonous drill and role memorization which constituted such a 
large part of school work in the days when men like Darwin or Hume 
attended school would prove particularly irksome to a bright child. 
Darwin, for instance, seems to have been more interested in his col- 
lections of insects than in niemorizing Latin declensions, much to the 
annoyance of his teachers. Thus it is often impossible to accept the 
recorded opinions of parents or L ichers regarding the intellectual 
status of great men in childhood. 

More accurate information can be obtained from factual records 
of the specific behavior of the individual at various ages. An early 
attempt to conduct such an analysis of ihc boyhood of great men 
was made by Yoder (72). Fifty cases, repiescnting a wide variety 
of occupations or fields of eminence were selected from the great 
men of six countries. All the subjects were born in the eighteenth or 
nineteenth centuries, except Newton, Swift, and Voltaire, who were 
born in the seventeenth. In general, Yoder found that ill health in 

^ Also proposed by Lombroso (38). 
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childhood was often exaggerated by the earlier biographers and that 
this condition was not so prevalent as is supposed. Feeble or delicate 
health may, however, olTcr advantages in some cases by stimulating 
reading and intellectual pursuits. Dickens was a good example of 
this. In regard to intellectual status, Yoder reports that excellent 
memory and vivid imagination were often exhibited by great men 
from early childhood. 

A very detailed and comprehensive study of the childhood of great 
men was conducted by Cox (13), as one part of the Genetic Studies 
of Genius under the general direction of Fcrmaii. The tcclmique em- 
ployed was Terman's adaptation of the historiometry method. Through 
the examination of several thousand biographical references, infor- 
mation was gathered on the traits of 301 eminent men and w^omcn 
born between 1450 and 1850. Particular attention was given to child- 
hood behavior, such as age of learning to read, letters and original 
compositions which may have been preserved, and early interests. 
Any special circumstances which might have influenced the subject’s 
development were also noted. The material so collected w^as analyzed 
and evaluated independently by three trained psychologists. Each 
investigator estimated the knvest 10 compatible with ihc^iven facts 
for every subject, and the average of these three independent judg- 
ments was taken as the final minimum IQ estimate for the given 
individual. 

After allowing for certain inaccuracies in the data, Cox concludes 
that the average 10 for the group “is not below 155 and probably at 
least as high as 165“' (13, p. 217). The estimated minimum IQ’s 
ranged approximately from 100 to 200. The same geniuses cited by 
Lombroso and others as instances of early mental inferiority were in- 
variably found to give evidence of high IQ’s during childhood. Among 
these may be mentioned Lord Byron, Sir Waller Scott, and Charles 
Darwin, whose estimated childhood IQ’s proved to be 150, 150, and 
135, respectively. Among those receiving IQ’s above 180 were 
Goethe, John Stuart Mill, Macaulay, Pascal, Leibnitz, and Grotius. 

Another interesting finding pertains to the average estimated 10 
of persons achieving eminence in dilTercnt fields (53). Philosophers 
topped the list with a mean JQ of 170; next came poets, dramatists, 
novelists, and statesmen with 160; scientists had a mean of 155, 
musicians 145, artists 140, and military leaders 125. This hierarchy 
probably reflects at least in part the close association of ‘Intelligence” 
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with verbal aptitude in our present standards of evaluation. Those 
groups with mean lO’s of 160 or over were engaged in activities in 
which written or spoken language played a predominent role. Far- 
thest from the verbal field in their area of accomplishment arc the 
persons at the bottom of the list: military leaders, artists, and 
musicians. 

In the same survey, 100 geniuses were selected for whom the rele- 
vant records were especially full, and ratings were assigned to each 
person on a number of specific intellectual, emotional, and character 
traits. These ratings, like the lO’s, were based upon the childhood 
behavior of the subjects, and the averages of two independent raters 
were used. As a group, the subjects proved to be unquestionably 
superior in all tJic traits rated, and were especially ouistanding in such 
characteristics as desire to excel in their eirv)rts, steadfastness of 
efioit, persistence in the face of obstacles, intellectual vvorf devoted 
to special pursuits, profoundness ol apprehension, and originality 
and creativeness. Another sub-group of 50 cases, similarly selected 
because of fullness of data, were rated in a like manner for physical 
and mental licalth in childliood. The distribution of the group in 
these respects is reported to be fairly normal and to show no greater 
per cent of unfavorable deviants tluin are found among imselected 
school children. 

Some of the inconsistencies and confusions regarding the asso- 
ciation of ‘'genius’’ and “insanity” may result from the common use 
of these blanket categories as though they represented single entities 
(41, 64). If we ask w/uit of genius and wluil kind of abnor- 
mality, we arc more likely to get a signilicanl and consistent answer. 
Rc-analyscs of the original Cox data, for example, have showm that 
the incidence of emotional abnormalities is greatest among the “aes- 
thetic type” (poets, novelists, artists, musicians) and the “reformer 
type” (revolutionary statesmen or radical religious leaders). It is 
least among scientists, soldiers, statesmen, and conservative religious 
leaders. The more “imaginative” genius is likely to show more psycho- 
pathic characteristics than the eminent “man of action.” As for spe- 
citic types of abnormality, analyses ci the same group suggest that 
introversion, emotional excitability, and fanatical self-confidence are 
the most frequent. Considering how often these geniuses were right 
in their novel ideas, the last-mentioned symptom seems to be more 
indicative of fanaticism in the rest of mankind than in the genius! 
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Other investigators have corroborated these findings regarding the 
specificity of the “genius personality.” One survey (47) compared 
120 men of science with 123 men of letters, both groups having lived 
during the nineteenth century. The literary group was limited to poets, 
novelists, and dramatists; the scientists included only workers in the 
biological and physical sciences and in mathematics. One interesting 
dilTercncc was found in the socio-economic backgrounds of the two 
groups, rhough both the scientists and the literary men came chiefly 
from the professional class, the two groups diflercd in that the scien- 
tists were much more likely than the men of letters to come from 
the farmer and artisan class. For the men of letters, the socio- 
economic class which ranked second in frequency to the professional 
was the semi-professionaL On the other hand, actual poverty was 
more often reported for literary than for scientific men. The scientists 
as a group were described as more cheerful, modest, and sociable. 
The literary men excelled in persistence, but were also more emo- 
tional, gave more evidence of neurosis, and had a slightly poorer 
health record both in childhood and adulthood. Also relevant is a 
recent survey (50 ) of the characteristics of “research workers,” con- 
ducted by a similar method. Biographical material was examined for 
250 research workers ranging from Euclid and Pythagoriis to con- 
temporary living scientists. Among the characteristics found most 
frequently were creativcncss, enthusiasm, and aggressiveness; least 
frequent were religiousness, siSif-control, and good health.*'' 

The results of all these studies have to be accepted with cautVm 
because of possible weaknesses in the procedures. Much depends 
upon the representativeness of the samples, the fullness of the avail- 
able data, and the objectivity and accuracy with which the recorded 
behavior items arc evaluated by the investigator. When great men of 
the past arc considered, a certain amount of historical perspective 
is also required, in order to judge the individual against his own cul- 
tural setting. On the whole, however, such studies do show that the 
men and women who achieved eminence tended to come from favor- 
able environments, gave early indication of superior ability, and were 
not as a group appreciably more unstable than the less gifted. At rhe 
same time, it should be clear that “the genius” is not one but many 
kinds of person. 


Cf. also Kossman’s survey of the dominant characteristics of inventors, as re- 
ported by the inventors themselves (49). 
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THE GIFTED CHILD 

The “Child Prodigy.” Since geniuses have generally displayed superior 
talents in childhood, a direct study of gifted children should prove 
fruitful in an analysis of genius. The traditional or popular concept of 
the “child prodigy” is that of a weak, sickly, unsocial, and narrowly 
specialized individual. His achievements are expected to be of the 
nature of intellectual “stunts” and to have little or no practical value. 

One of the earliest recorded cases of such a child prodigy is that 
of Christian Heinrich Hcinckcn, whose achievements are described 
by his teacher in an old German book published in 1779 (cf. 29, 46). 
At the age of 10 months this child was able to name objects in pic- 
tures; before 12 months he had memorized many stories in the book 
of Moses; and at 14 months he knew the stories of the Old and New 
Testaments, At 4 years of age he could read in his native language, 
had memorized 1500 sayings in Latin, and also knew Frcncli. At this 
time he was able to perform the four fundamental arithmetic opera- 
tions, and he knew the most important facts of geography. His fame 
spread throughout Luropc and he was summoned to appear before 
the King of Denmark. True to the traditional picture, however, Chris- 
tian Heinrich was a sickly child, and at the age of 4 ycars-4 months 
he died. 

Contrary to popular belief, the case of Christian Heinrich is not 
at all typical. As an example of a highly gifted child who developed 
into a healthy and successful adult wc may consider the case of Karl 
Witte (cf. 66). Born in I ochau, Prussia, in 1800, this “child prod- 
igy” lived until he was 83, having retained his excellent intellectual 
powers to tlie end. Karl was literally educated from the cradle. His 
father wjis convinced of the cOicac) of early training and undertook 
to prove this with his s^m. Vhe cluiJ was never taught “baby talk.” 
All the games he played were games of knowledge. When only 8 
years old, he read with apparent pleasure the original texts of Homer, 
Plutarch, Virgil, Cicero, Fcnelou, Florian, Metaslaslo, and Schiller. 
He matriculated as a regular student at Leipzig at the age of 9. Be- 
fore his fourteenth birthday he was gi ’.ted a Ph.D. degree. Two years 
later he was made a Doctor of Laws, being at the same time ap- 
pointed to the teaching stair of the University of Berlin. 

Karl Witte’s father, in discussing the boy's education, wrote: 

... he was first of all to be a strong, active, and happy young man, 
and in this, as everybody knows, 1 have succeeded. ... It would have 
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been in the highest degree unpleasant for me to have made of him pre- 
eminently a Latin or a Greek scholar or a mathematician. For this reason, 
1 immediately interfered whenever I thought that this or that language or 
science attracted his attention at too early a time (66, pp. 63-64). 

Karl seems not to have been in the least vain or spoiled. He never 
paraded his knowledge, was modest and unpretentious, and not in- 
frequently tried to learn from his companions what they knew better 
than he. He had many playmates of his own age and we arc told 
that ‘'he got along so well with them that they invariably became 
very fond of him and nearly always parted from him with tears in 
their eyes"' (66, p. 187). 

Contemporary case studies of gifted children by psychologists like- 
wise lend no' support to the view that such children are necessarily 
inferior in other respects. Jn 1942, L. S. Hollingworth hroaight to- 
gether in one book (29) 31 case reports of children wjiose lO's were 
over 180. Such lO’s should occur about once in over a million cases. 
The accomplishiiKnts and adjustment of children in these IQ levels 
are illustrated by tlic following cases. 

A gifted juvenile author, Elizabeth , obtained a Stanford- 

Binet 10 of 188 when tested at the age of 7 ycars-10 mor\fhs (cf. 54; 
55; 29, pp, 35- 37). She ranked high Jn all other intellectual and 
educational tests, but showed a special interest and talent for the com- 
position of prose and poetry. This child was reported to be in excel- 
lent health and free from physical defects; she was a year or so accel- 
erated in physical development. Elizabeth’s superior linguistic abilities 
were apparent from an early age. At 19 months she could express 
herself clearly and also knew the alphabet. By her eighth birthday 
she had read approximately 700 books, including such authors as 
Burns, Shakespeare, Longfellow, Wordsworth, wScott, and Poe. By 
this age she had also written over 100 poems and 75 stories. The 
followijig is a specimen of her literary products, written at the age of 
7 years- 1 I months and entitled "Fairy Dcllnition”: 

Fairies are the fancies of an imaginative brain 
Which wearying of earthly realities aspires to 
Create beings living only in thought 
Endowing the spirits thus created 
With all genius for giving Happiness. 

A case which attracted wide attention in the 1920's is that of a boy 
known in the psychological literature as E (29, pp. 134-158). 
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When first tested at the age of 8 years- 1 1 months, E obtained a 

mental age of 15-7, which gave him an IQ of 187. He also did well 
on all other tests except those involving manual dexterity. He is re- 
ported as being strong and healthy, but not much inclined to indulge 
in games and sports. At the age of 12 he was admitted to Columbia 
College. On the Thorndike Intelligence Examination for High School 
Graduates, he ranked second among 483 competitors. During his 
freshman year at college all his academic grades were B or better, 
with the exception of ph>sical education, in which his grade was C. 
He is described as being a “good sport’' and getting along well with 
the other students. He received his A.B. degree at 15, being also 
elected to Phi Beta Kappa. At 16 he obtained his M.A. degree, and 
by 18 had completed practically all requirements toward the Ph.D. 
degree except the dissertation. On the CAVD Intelligence Exami- 
nation, his score was 441, which falls approximately in die upper 
14 of 1% of college graduates. Thus, during the period over which 

he was investigated, E showed no tendency to drop below the 

high intellectual level indicated by his initial IQ. 

These cases arc typical examples of intellectually superior chil- 
dren. Exceptional talents in childhood are not incompatible with good 
health, pliysical vigor, longevity, or a well* rounded personality. To 
be sure, puny, timid, and sickly children can be found among the 
gifted, as among the intellectually normal or dull. But such cases arc 
very few and cannot be rc^garded as representative of the group as 
a whole. 

dhc highly gifted may, f course, have their own special adjust- 
ment problems, especially during childhood and adolescence, by vir- 
tue of their exceptional intellectual status. But such maladjustments 
arc an indirect result of high inicl'cct, rather than a cause or an 
intrinsic component of genius. Among the possible problems encoun- 
tered by the child whose IQ is much above 150 (28, 29) are those 
arising from the fact that he is younger and hence smaller and \vcaket 
than his classmates. This condition may m:4e him more susceptible 
to bullying and may interfere with h!. participation in athletics and 
active games. A second source of duhculty is the "dsoiation" from 
contemporaries and from the common activities of others which is 
likely to result when the individual’s interests and abilities are so 
unlike those of his fellows. Ncf^ativisni toward authority may develop 
when the child r.ailizcs that authority is often irrational or erroneous 
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in its operation. Intolerance and unwillingness to “suffer fools gladly” 
may follow observations of .relatively inept tliinking on tlie part of 
associates. The superior child may also develop habits of inefficient 
work and laziness because ordinary school work offers no challenge 
to him. Such work habits may carry over into later educational and 
even vocational activities. 

For these reasons, L. S. Hollingworth (28, 29) concludes that the 
optimum IQ from the viewpoint of personal adjustment, leadership, 
and acceptance by one’s fellows — with the “accompanying emolu- 
ments and privileges” which such acccpt:mce entails — fails between 
130 and 150. To be sure, the adjustment difficulties of the highly 
gifted child are of the sort that can be prevented by proper under- 
standing and a suitable educational environment (52). During the 
past two or three decades the special education of the gifted child 
has made rapid strides,^ a progress to which L. S. Hollingworth her- 
self made some of the most outstanding contributions (25, 26, 29). 
The outlook for even the most highly gifted “prodigy” need not, there- 
fore, be a pessimistic one. 

Intelligence Test Survey.s. The testmg of large groups of intel- 
lectually superior children has revealed the continuity which exists 
between the average child and the highly gifted “prodigy^’ In order 
to include a sufficiently large number of cases in such studies, the 
standard of selection must be lowered, liut by surveying a wider range 
of superior intellect a more complete picture will be obtained. Since 
the rise of the mental testing movement, a number of studies on 
moderately large gioups of superior children have appeared (cf. 40). 
The most extensive project of this sort is that begun in 1921 by 
Tcrman and his associates, and reported in llie Genetic Studies of 
Genius (cf, 55, 6, 58). Because of the more comprehensive nature 
of this study and its essential agreement with the findings of other 
investigations, it will be described in greater detail. 

The total group employed in Icrman’s study (cf. 58, Ch. I) in- 
cluded 1528 California children, ranging in 10 from 135 to 200 and 
in age from 3 to 19. These children represent approximately the 
upper 1% of the school population. Of these, 661 elementary school 

^ As rally as l'^r4. The National Society for the Study of rdiication devoted one 
of its YcattH)oks cntiitly to teaching methods suitable lor gifted children. For a 
survey of nioie recent deve)oj>incnts in this held, cf 40, 43, 60, 69, H). Attention is 
also called to the iccently formed American Association for Gifted Children, which 
is specially concerned with the problems of the gifted child (65). 



Genius 


599 


children constituted the “main experimental group,” on which the 
major findings of the initial test survey were based. This group was 
compared, in an extensive scries of tests and measures, with control 
groups composed of random samplings of school children. For reasons 
of expediency, different control groups A\erc employed for various 
comparisons, the number of cases in such groups ranging from about 
600 to 800. 

The socio-economic level of the gifted group was decidedly su- 
perior. Among the fathers of the gifted children, 3 1..4% belonged to 
the professional class, 50% to the semi-professional or higher busi- 
ness class, 11.8% to the skilled labor class, and 6.8% to the semi- 
skilled or unskilled labor class. The average sehool grade reached 
by tlie parents of the gifted group was 1 1 . 8 , and by the grandparents 
10.0. Tn comparison to the average person of their generation in 
the United Slates, the parents in this group had received from 4 to 
5 grades more schooling. Moreover, a third of the fathcis and 15.5% 
of the mothers had graduated from college. The number of eminent 
relatives and ancestors was also far in excess of that which would 
be expected by chance, and many of the families had highly distin- 
guished gcncalogiics. 

The homes of the gifted children were visited by field workers, and 
were rated from 0 to 6 on necessities, neatness. siz»;, parental con- 
ditions, and parental supervision." TIic average rating of the homes 
was over 4.5 in each of these five categories, and only 10% of the 
homes received a total rating which was distinctly poor. Neighbor- 
hood ratings and income I vcl »vcrc also considerably better than the 
generality for California. 

We may next consider certain ^ :fal statistics as well as medical 
and physical data obtained on the gifted clnldren themselves. The 
frequency of insanity hi the family Was lower than average. Only 
0.4% of the parents and 0.3% of the grandparents and great-grand- 
parents had a record of insanity. As in the studies on adult genius, 
the gifted group contained a greater proper' ion of first-born children 
than the general population. The gifa d children developed at a more 
rapid rate than the normal from eany infancy. I’hey walked on the 
average one month earlier and talked 3V2 months earlier than the 
control groups. The onset of puberty was also somewhat earlier than 
normal. Physicians’ examinations showed superior health and relative 

■^The Whittier Seale for Grading Home Conditions was used for this purpose. 
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freedom from defects in the group as a whole. Similarly, such con- 
ditions as “nervousness,” stuttering, headaches, general weakness, 
and poor nutrition were less common in the gifted than in the control 
groups. In height and weight, physical and muscular development, 
and strength, the overlapping of gifted and control groups was almost 
complete. Such differences as did occur, however, favored the gifted 
group. 

The educational accomplishments of the gifted group were, of 
course, far in advance of the normal.^ About 85% of the gifted chil- 
dren were accelerated and none was retarded. The administration of 
standardized achievement tests in school subjects revealed that the 
majority of these children had already mastered the subject matter 
from one to three grades above that in which they were located, 
riuis with reference to his actual abilities, the gifted child is often 
retarded rather than accelerated in school-grade location. The gifted 
children as a group tended to excel in all school subjects; one- 
sidedness was not characteristic of these children. Their superiority 
was greatest, however, in such subjects as language usage, reading, 
and other “abstract" work, and least in shop training, sewing, cook- 
ing, and similar "craft" subjects. 

The gifted group displayed a wide range of interests outside of 
their school work, as well as an active play life. A two-month read- 
ing record kept by the children showed that the gifted read more than 
the control at all ages. At T, the number of books read by the gifted 
group was three times that of the control, dhe range of topics cov- 
ered was also wider and tlie cjuality of the books superior in tlie 
gifted group. Similarly, the gifted children w'crc more enthusiastic, 
had more intense interests in general, and reported more hobbies than 
the control group. Collections were nearly tw'icc as common among 
the gilted as among the control, and tended also to be larger and 
more often of a scientific nature. A c]uestionnaire on play information 
showed that the typical gifted child of 10 knew more about playing 
and games than the average child of 13. Apart from the fact that 
the play interests of the gifted children were more mature than those 
of the control children of their ow'n age, no conspicuous dilTcrences 
were found in their play activities. 

In character and personality development, the gifted children were 

^ This ^vas pailly the result of the method of selection. Teachers wcie asked to 
name the brightest children as well as the youngest child in each class, and Irom 
among these the gifted subjects were chosen by intelligence tests. 



Genius 


601 


also found lo be in advance of the normal. This was confirmed both 
by scores on objective tests of emotional adjustment and character 
traits and by parents’ and teachers’ ratings. On a specially devised 
battery of se'^cn objective personality tests, the differences in favor 
of the gifted group were large and significant in every test.-' From about 
60% to 80% of the gifted group equaled or excelled the average of 
the control group iii each of these tests. 

The findings of the California study have been closely corroborated 
by studies on similar groups in the Middle West by Witty (67, 68), 
in New York by L. S. Idollingworth (24. 27, 20), and in England 
by Duff (16). Superior home and parental background, better-than- 
a^'crage health and physique, outstanding educational achievement, 
and greater emotional maturity and stability were characteristic of 
all these gifted gnnips. 

THE GH TED CtHI.D GROWS UP 

Among tlic many superstitions entertained in regard to geniuses and 
child prodigies is that which claims that the gifted child deteriorates 
as maturity is approached and that his ultimate mental level will be 
average or even inferior. Prolonged case studies on a few individuals, 
as well as a number of scattered investigations on groups of gifted 
children, have quite conclusively disproved this view. 

The most extensive follow-up of a large group of gifted children 
is that conducted under the direction of Fcrman, and reported in 
Volumes 111 and IV of tlw Genetic Studies oj Genius (6, 58). An 
integral part of the plan of the California study included periodic 
follow-ups of the original group of gi'^Vd children. The first follow-up, 
after six years, involved the retesting of small samples of the original 
subjects, as well as a detailed progress report on a larger proportion 
of the group. At this time, most of the subjects were in their adoles- 
cent years. T he high school records of the group were fully as dis- 
tinguished as their performance in elementary school. Achievement 
tests, as well as intelligence tests, s’a-wed continued superiority, as 
did also general health and personal atid social adjustment. Participa- 
tion in extracurricular activities and leadership among classmates 
were especially outstanding for the group. 

'♦Ciitual latios { tiiff. 'a.ii ) iani:ed fiom 3.87 to 14.41. Ciitical ratios of 3 or 
more indicate that the chances of .i tiue diflfcreiKe aie o^e^ 99 TOO. 
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Subsequent follow-ups were conducted in 1936, 1940, and 1945. 
The 1936 follow-up was a preliminary questionnaire survey, whose 
findings were superseded and rendered obsolete by the two later 
follow-ups. The 1940 follow-up was a thorough and comprehensive 
one, involving an extensive testing and interviewing program by field 
investigators. At this time, the average age of the group was 30 years. 
Of the original total of 1528 children, 61 were deceased in 1940 
and 33 could not be traced. The remaining 1434 cases participated 
in the intensive survey. In 1945, a supplementary follow-up was con- 
ducted by mail. By this time the majority of the group were 35 years 
old, an age at which adult careers are clearly taking shape. The 
results of the 1940 and 1945 follow-ups, taken together, constitute 
the basis for the analysis of the adult status of the gifted group, re- 
ported by Terman and Oden in The Gifted Child Grows (58). 
Plans are under way for the further continuation of this extensive 
longitudinal study. 

Adult intellectual status was measured by a specially constructed 
Concept Mastery Test consisting of opposites and analogies and cov- 
ering many fields of information. Through this test it was possible 
to estimate that the average adult 10 of the gifted group was about 
134, representing a drop of 17 points from their childhood average 
of 151. The authors show that such a drop is no greater than would 
be expected from regression (cf. Ch. 8). Such regression, however, 
w'ould result not only from- errors of measurement in the Stanford- 
Binct and the Cc)ncept Mastery Test, but also from differences in the 
functions measured by the two tests, as well as from actual behavior 
changes in the subjects resulting from maturation or learning. In 
other words, predictions over a twenty-five-year period are subject 
to considerable error, not only because of the unreliability of the 
tests, but also because much can happen to change the subjects dur- 
ing such a period. The important point, however, is that the obtained 
change in this group is not significantly different fiom that expected 
by chance and gives no evidence of any special decline of ability. 

Educationally, the gifted group excelled in all comparisons. They 
attended college in much larger numbers, took graduate degrees much 
more often, and received better grades and many more academic 

^^This is the follow-up whose lesults were reported by Termrn and Oden in 
1940 (56, 57 ). 

This book is Volume TV of Genetic Studies of Genius. 
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honors than any other groups with which comparisons were made. 
Among the men, 69.5% completed college, and among the women 
66.8%. The per cent receiving Ph.D. degrees was over five times 
as large for the men and over eight times as large for the women 
in the gifted group as in a representative sampling of college grad- 
uates. A special study of educational acceleration in the gifted group 
not only showed acceleration to have been common, but also lent no 
support to the view that such acceleration may be detrimental. Any 
slight social handicap suffered by the very accelerated subjects during 
adolescence seems to have been fully overcome in later years. Tn 
fact, whatever differences were found in later achievement or adjust- 
ment tended to favor the accelerated group. 

TABLE 34 OccupaliouaJ Cldssificalion of GiUcd Men and of All 
Employed Men in ^'\ilifornia (I9d0) 


( I'l'ini ,niil 0 ('mi, p 17!' 
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1 . 
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8.1 

III. 
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24.3 
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1,2 

12 4 

V. 
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6 2 

3J 6 

VI. 

.Shghttv skilled trades 

0 7) 

17.8 

Vii. 

Day iaboicfs: Uibu'- and rural 

0.0| 


In occupational level, the gifted g 'up stood far above the average, 
being represented in the higlrcr prokssions by eight times its propor- 
tional share. In Table 34 will be iound the oeeiipational distribution 
of the gifted men. togetlier with the corresponding distribution of all 
employed males in the ld40 California census. Nearly half of the 
gifted men are in the professumal category, as contrasted to less than 
6% of the generality; the corrcspr.;'''i'ig proportions in the semi- 
professional and higher business category are 25.7% and 8.1%, re- 
spectively. On the other hand, only 6.2% of the gifted men are in 
semi-skilled trades, as against 31.6% of the generality. Similarly, less 
than one per cent of the gifted group are in the slightly skilled trades 
and none in the unskilled, as contrasted to 17.8% of the generalit/' 
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in these two classes combined. Even in comparison with groups of 
male college graduates, the gifted group excels markedly in occupa- 
tional status. 

The occupational history of the gifted women is much more dif- 
ficult to interpret, since jobs and careers have a very different signifi- 
cance for the two sexes in our culture. Of the entire group of gifted 
women, 42% were housewives and not gainfully employed. Only 48% 
reported full-time employment at the time of the follow-up. Among 
those employed, the largest number (30.8%j were in secretarial 
or other office work, and the second largest (21.1%) in elemen- 
tary or high school teaching. Social work, arts, writing, and college 
teaching and research each claimed from 5% to 7%. Perhaps the 
most outstanding finding in the comparison of the gifted women with 
other groups of women college graduates is the smaller proportion 
of the gifted who chose teaching and the larger proportion who 
chose office work. The interpretation of any of these results would 
be hazardous, in view of the multiplicity of factors which influence 
the occupational activities of women in our society today. The dis- 
cussion of sex differences in the next two chapters may help to clarify 
some of these results. 

The mortality rate of the gifted group was hclow' that of the gen- 
erality, and both physical and mental health remained superior. The 
incidence of delinquency, alcoholism, and serious maladjustments 
was less than in the general populatiim, and there was considerable 
evidence of good emotional and social development and breadth of 
interests. Participation in cxtnicurrieular activities was as conspicu- 
ous in college as in high school. Hobbies and avocational interests 
were well developed and closely resembled those of any contemporary 
American group. An active interest in political and social matters is 
suggested by the fact that 91% of this group reported that they voted 
in all natkmal elections, in contrast to only about 70% in the general 
Califcnnia population, ddie social and political attitudes of the gifted 
group showed no marked deviation from the generality. The subjects’ 
war records, in both military and civilian capacities, were also found 
to be quite creditable and distinguished. 

Of considerable interest arc the data on marital status and marital 
adjustment. The incidence of marriage among both the gifted men 
and the gifted women is above that of college graduates of the same 
age, and is about equal to that in the general population. Intelligence 
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tests of the spouses as well as the offspring showed both to be quite 
superior, but below the average of the gifted group itself. On specially 
designed tests of '‘marital aptitude” and “marital happiness,” the 
present group was somewhat superior to other groups less highly 
selected in intelligence. Sexual adjustment was in all respects as 
normal as in less gifted groups. Divorce rate was no higher than in 
the generality of comparable age. 

A special study of individuals whose initial IQ’s had been 170 or 
higher showed them to compare favorably with the rest of the gifted 
group. Ihcy were more often accelerated in school, received better 
grades, and continued their education longer than the average of the 
entire group. They were as well adjusted emotionally and more suc- 
cessful vocationally than the rest of the group. Thus it seems that this 
particular group of exceptionally gifted children were, on the w'hole, 
able to overcome the special problems and diilieulties wiiich their 
high intellectual level might engender. 

Probably one of the most interesting analyses in the entire survey 
is the comparison of the 150 men rated “most successful’' (Group A) 
with the 150 rated “least successlul” (Group C) in adult achieve- 
ment. Despite the high average accomplishments of the entire gifted 
group, the adult achievement of individual members ranged from 
‘’international eminence to unskilled labor” (58, p. 311). In the 
cflort to clarify some of the correlalcs of adult achievement, the two 
contrasted groups A and C w'crc compared cm about 201) items of 
information which had been secured between 1921 and 1941. The 
most conspicuous dilTcrcnc. were the supeiior educational and voca- 
tional level of the parents of the “A’' men, as well as the greater 
“drive to achievement” on the part of the ‘^A” men. Tor example, 
over 50% of this group had lalhcn who were college graduates, in 
contrast to 15.5% of the “C” men. More than twice as many fathers 
of the A’s were in the professions. As for the subjects themselves, 
both self-ratings and ratings by family and associates showod the 
largest A-C dilfcrenccs in ‘integration toward goals,” “perseverance,” 
and ‘’self-conridcncc.” Significant dih.'ivnces in favor of the “A” men 
were found in school acceleration, me A group graduating from 
elementary, high school, and college at younger ages. Initial IQ's 
also averaged significantly higher for the A group; but this difference 
was not large, the two averages being 155 and 150. In summary, fac- 
tors related to home background seemed to play a major role in the 
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adult achievement of these men, all of whom were within the upper 
levels of intelligence. Among such men, motivational factors — them- 
selves probably traceable to environmental conditions — often made 
the difference between outstanding achievement and mediocrity. 

From an over-all view of such follow-up investigations on gifted 
children, what can we conclude? At the outset, it should be noted 
that some corroboration of the California findings, although on a 
much smaller scale, is provided by follow-ups of the New York 
(30, 31, 39) and midwestern (68) groups cited previously. These 
studies, too, indicate that the gifted child, on the whole, grows up to 
be an intellectually superior and fairly well-adjusted adult. 

There is a possibility that the California results may be unduly 
optimistic about the emotional adjustment of children in the highest 
10 levels. Perhaps the mcthoci of selecting the original group may be 
partly responsible for such a finding. The major group was chosen 
on the basis of teachers’ recommendations, the children thus recom- 
mended being then given intelligence tests for the final selection. As 
a check on this procedure, the entire population of three schools was 
tested, following the teacher nominations. The results showed that 
about 90% of all the children who qualified for the study on the 
basis of test scores W'ould have been reached by the usiiaT procedure. 
It is possible that the 10% who were thus lost to the study may have 
included a disproportionately large number of scholastically and emo- 
tionally maladjusted cases. ‘‘Tdicir exclusion from the study by the 
method of search might thus lead to an unduly optimistic picture in 
these two respects.^- The possible effects of participation in the study 
upon the subjects’ subsequent development should also be considered. 
No control subjects were follow'ed up along witfi the gifted subjects. 
Not only the knowledge that one is a ‘‘gifted child,” but the personal 
interest in each subject which was apparently shown by the field in- 
vcstigatc^rs and project directors cannot be completely discounted. 
The experimental design employed in the study includes no control 
for this factor. 

In reference to the question of what constitutes “genius” and how 

^''^Corroborative evidence is ptovided by a recent test survey of over 45,000 
children in giadcs 4 to 8 (56). Without knowledge of lest icsults, leaclters were 
asked to indicate each child whom they considered ‘'a distinct problem.” “extremely 
mentally retarded,’* and “a genius.” The influence of the child’s class' oom adjustment 
and academic interests and achievement was evident m the clioices for the *‘gcniiis” 
category. 
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the eminent adult is related to the gifted child, several points may be 
noted. First, it has been pointed out by L. S. Hollingworth (27) that 
an IQ of 135 or 140 is certainly far below the “genius level.” Such 
IQ’s fall within the upper quarter of college students. In fact, in some 
of the better colleges, the mean IQ is close to 150. On the basis of 
her own follow-ups of groups of gifted children, Hollingworth pro- 
posed that an IQ of IHO or higher is more nearly at the ""gcaius 
level,” equipping the individual for academic and professional dis- 
tinction, original and creative work, the winning of prizes, and other 
evidences of eminence. 

Any definition of genius, however, is so intimately related to the 
specific cultural setting that to consider the individual apart from the 
time and place in which he lived and worked is highly artificial. It is 
quite generally agreed, morciner, that a high IQ alone is no guar- 
antee of “genius.” Some writers (2, 7) have particularly ernpliasized 
the role of special aptitudes, such as talent in art, music, or mechanics, 
in their definition of genius. The importance of motivational and 
emotional factors, stamina, environmental background, and oppor- 
tunity has also been repeatedly stressed. Many have been impressed 
with the quality that so often makes the genius undertake and persist 
in what others have labeled “impossible.” Jt was this point that Bolitho 
eloquently expressed when he wrote: “Where common sense is hor- 
rified, where the sign Impossible’ is raised in warning, kindness or 
spiteful jov, there is your exit, prisoner: there is the door of ad- 
venture,” 
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Sex Differences?: 
13asic Problems 


Specialization oe vocational A( iivutls vviili regard lo the sexes 
has been a powerful social tradition in almost all cultural groups. 
I'hc particular tasks assigned to each sex vary from group to group 
and arc even occasionally leverscd, but some dilTcrcntiation of activ- 
ity is practically universal. These distinctions are impressed upon the 
individual from early childhood, either by actual overt diircrences in 
training and play activities, oi by the moic subtle but perhaps more 
clTectivc inculcation of traditional beliefs and idci\ls. It is apparent 
that in most societies the eircctuai environments of the two sexes arc 
fundamentahy diverse from an early age. Under such coTiditions, we 
should expect pronounced variation in the emotional and intellectual 
development of the two sexes. By a curious circular argument, how^- 
cver, these socially conditioned behavioral difl’erenees are often attrib- 
uted directly to innate factors. 

The belicl in hereditary sex dilTercnces in intellectual and emotional 
traits is an old and persistent one. it is only since the development 
of objective and quantitative testing methods that the notion of 
‘'female inferiority" has been dispelled among scientists. In the gen- 
eral public, this belief still prevails, as is manifested by the reluctance 
to open certain educational and professional opportunities to women 
and by the frequent discrimination against individuals on the basis 
of sex alone. The reasoning underlying such practices is that it would 
be futile to provide identical training for men and women, since the 
existing dilTercnces in their behavior are so clearly apparent. This 
view, of course, fails to consider the possibility that the existing sex 
diflercnees may themselves be the result of the diverse training and 
environment of the two sexes. 

The objective study of sex differences in intellectual or personality 
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traits began shortly after the rise of the mental testing movement. In 
1910 Woolley, one of the first investigators in this field, listed less 
than a dozen psychological studies on sex dilTcrcnces (52). A review 
appearing in 1926 contained over 200 such lilies (26). In 1935, 
one bibliography included more than 300 studies (33). Today, the 
relevant investigations number well over a thousand and the entire 
field is rarely covered in a single survey. Hvery type of function has 
been surveyed, from sciisori-motor processes, through simple per- 
ceptual and associative tasks, to more complex intellectual activities 
and personality characteristics. Almost all tests, shortly after their 
construction, have been administered to men and women, and their 
scores compared. It was a relatively easy task to gather such data, 
especially after tire advent of standardized group tests; but it was 
quite a different matter to determine what these data meant. 

In common with other group comparisons, the study of sex dif- 
ferences in behavior presents a number of methodological dilficiilties. 
An understanding of these pi'oblcnis is essential for the proper inter- 
pretation of the lindmgs of any reported stLidy. Foi this reason, we 
shall begin by considering the basic questions which must he raised in 
the evaluation of any data on group difieicnces. 

r.VALUAIlON or GROUP DlPrLKl:N(.ES 

Selective Factors. In all group comparisons, selective factors may 
operate to vitiate the results. When a group is not a random or 
representative sample of ire population from which it is drawn, it 
is said to be a select group. Such a sampling is imsuited for any type 
of investigation, since any results obtained with it could not be gen- 
eralized but would apply only to the speeitie gremp (miploycd. An 
additional complication in the comparison of two populations arises 
from the fact that selection may have operated dilTerently in the two 
groups. Thus if a group o\ college girls w'crc compared with trade 
school boys, the two samplings would be ^elected in different w'ays. 
Not only is neitiier group representative of men or women in general, 
but the one lepicsents the upper end of the female distribution 
and the other a central or slightly inferior segment of the male dis- 
tribution with respect tc> education and correlated variables. In addi- 
tion to being unrepresentative, these groups arc not comparable. 

Selective factors are often difiicult to delect and usually difficult to 
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control. An example of such a selective factor is provided by the 
comparison of high school boys and girls. Offhand, we might say that 
groups of boys and girls attending the same high school constitute 
truly comparable samples for the study of sex differences. But investi- 
gations on elementary and high school students have demonstrated 
that this is not the ca^e. 

Let us examine, for example, two independent studies in which the 
Pressey Group Test of Intelligence was administered to 2544 elemen- 
tary^ school children between the ages of 8 and 16 (37) and to 5929 
high school seniors ranging in age from 16 to 23 (5). The percent- 
ages of boys who reached or exceeded the median score of the girls, 
as well as the number of cases in each group, are shown in Table 35. 
In the elementary school study, the data arc reported separately for 
each age group. In the study on high school seniors, a single summary 


TABLE 35 Sex Differences in Intelligence Test Scores of L lenientary 
School and High School Croups 

37 , 1 ) ;>iul aiwl INrcadows, \ p (>1) 


JAcificntarv ScJmol Gump 

T^nmln 

of 

Pcf ('cut of H(>v\ Rcochifi^ 

As-^c 

Boys 

(ll! Is 

or Lxcecdiu^ Gnls Median 

8 

57 

92 

40 

9 

^ 132 

154 

34 

10 

176 

177 

42 

11 

179 

167 

41 

12 

182 

180 

44 

n 

174 

174 

39 

14 

138 

162 

43 

15 

102 

139 

41 

16 

62 

97 

49 

High School Seniors 

2422 

3503 

56.2 


figure is given for the entire group. It will he noted that in the elemen- 
tary school grades the girls excel at all ages, although the sex difference 
is negligible among the 1 6-ycar-olds. Among the high school seniors, 
however, this relationship is reversed, over 50% of the boys reaching 
or exceeding the girls’ median score. 

This reversal becomes intelligible if we examine the rel itive number 
of each sex in the elementary grades and in the senior year of high 
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school. Throughout the high school period there is a much more rapid 
elimination of boys than girls. Boys whose academic work is not satis- 
factory arc more likely to leave school and go to work, whereas girls 
tend to be kept in school longer. Girls also seem to adjust better to 
the school curriculum and school routine in general. The less intelli- 
gent girls will exert more cllort and manage to pass sufficient subjects 
to stay in school, while boys in the same situation are more likely to 
rebel against school work. This explanation was borne out by an exam- 
ination of the academic history of those students who had dropped out 
of high school. Owing to tlie differential action of this selective influ- 
ence upon the two sexes, diffeiences between the intelligence test 
scores of high school bo\s and girls cannot be regarded as true sex 
(Jiffcrcnces. In the evaluation of any study on group differences, selec- 
tive factors arc one of the most subtle forms of error to which we 
must be constantly alerted. 

Significance of a Difference. ('Inc of the first questions which the 
psychologist asks regarding any reported group difference concerns 
the statistical significance of the difference. The problem of signifi- 
cance arises fiimi the fact that in any investigation only a sample of 
the entire popahitiofi is employed. For example, if the population 
under investigation is defined as public school childieii in American 
cities, data may be gathered on some 5000 or 6000 children in a dozen 
schools. From these results, the investigator generalizes to the entire 
population. It the sampling was cai dully chosen to be representative 
of the given population, such conclusions wall not be far in error. I he 
figures thus obtained, how'^ver will not be identical to those which 
would have been secured by testing the entire population of American 
city public school ehildren. Nor will the results from successive sam- 
plings of the population coincide piifectly. Had another sampling of 
5000 city public school children been employed, slightly different 
results would have been obtained. 

1 his variation in lesults from sampling to sampling within the same 
population is known as a samplim: error. Statistical measures of rclkh 
bility provide a theoretical estimate of the probable limits within 
which such errors will fall. Formulas available for the computation 
of the sampling error of all statistical measures, such as averages, dif- 
ierenccs between averages, measures of variability, and correlation 
coefficients. It is thus possible to estimate the maximum amount of 
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variation to be expected in any statistical measure if the experiment 
were repeated on another sampling of the same population. 

When we ask, “Does this mechanical aptitude test show a signifi- 
cant difference in favor of the boys?” we mean simply this: “Would 
the boys’ average score still be higher than that of the girls if wc were 
to test the entire population of boys and girls from which our samples 
were drawn?” We rt^fer to the diflercnce we actually found as the 
“obtained” difference, and to the difference in the entire population in 
which we are interested as the “true” difference. 

By simple formulas,'^ we can compute the standard error of 
the difference (au.n.) and with it, the critical ratio or “t.” The latter 
is simply the ratio of the obtained difference to its standard error 
(t - diff. rpiiji ). It has been customary for many years to regard a 
critical ratio of 3 or higher as evidence that the obtained difference is 
significant. In other words, when the obtained dilTcrence in favor of, 
for example, the boys is 3 or more times as large as its standard error, 
we can be virtually certain that there is a “true” difference in favor 
of the boys in the entire population. 

jMorc recently, there has been a tendency for statistical workers to 
express the signihcance of a difference more precisely in jerms of the 
actual prohahility of a true difference in favor of one or the other 
group. With a t of 3, the probability that the obtained difference indi- 
cates a true difference is about 99.7 out of 100. For the probability 
to be exactly 99/100, the critical ratio would have to be 2.58. In such 
a case, the chances that the population dilference favors the same 
group which excelled in our tested sampling are 99 out of 100, and 
the probability that the difference is either absent or in fasH)r of the 
other group is only 1 out of 100. This is the basis lor the frequently 
encountered statement that the dilference is “significant at the .01 level 
of confidence.” Another way of expressing the same conclusion is to 
state that the probability that the obtained dilference resulted from 
chance factors alone and does not indicate a true group difference is 
.01 or less (P < .01 

A hypothetical example will serve to illustrate the use of such 
tests of significance. Let us suppose that a group of sixth grade school 

^ Cf. any recent textbook on psychological statistics, such tis Garrett, 13, Ch, VII. 

^ With vcjy small samples, the criticaf ratio must be considerably luigcr than 2.58 
to permit the same level of confidence in the rcsulls. Toi further treatment of these 
technical details, cf. Garrett (13). 



Sex Differences: Basic Problems 


617 


boys and girls obtain the following average scores on an intelligence 
test: 

Girls 85 

Boys 80 

DilTercnce 5 

Let US suppose further that we have computed tlic and found it 
to be 4, 4'he critical ratio of the obtained dillerencc will then be 5/4 
or 1.25. Since this is less than 2.58, the study has not demonsiiated 
the presence of a true sex ditfcrence at the .01 level of confuiencc. 
The 5 points in favor of our group of girls may result from chance 
factors, and another investigator, giving the same intelligence test to 
other samples of boys and girls, may find a difiercnce m favor of the 
boys or perhaps no difTcrcnce at ail. Wc can, in fact, estimate ’ that 
the probability that our obtained dilfcrcncc resulted from chance fac- 
tors is 21 / 1 00. This probability is considerably higher than the cus- 
tomary 1/100 at the .01 level of confidence. 

Wc may consider one further illustration, with the following data: 


Bo>s‘ avoKige 

130 

( Tills’ avciage 

no 

J difference 

20 

O.nll 

4 


In this example, t 20 4 -- 5. Since this is much larger than 2.58, 
we knenv that the likelihood that the obtained dilTcrence has arisen 
from chance factors is much less than 1/100. The sex difiercnce is 
therefore clearly significant at the .01 level of confidence (P < .01 ). 
Wc arc safe in concluding that there is a true difference in favor of 
the boys. 1 he chances ol our being wrong in such a conclusion are 
less than 1 out of UK). 

Ihe standard error of an obivdncd difiercnce depends upon the 
size of the samplings eiiiployed as well as upon the amount of varia- 
bility within the samplings. It is apparent that the larger the sampling, 
the more reliably will the results be established. If the sampling were 
infinitely large, the standard error would be zero, since the entire 
possible population would then have been included. In most of the 
earlier investigations on sex difTeKuces, the samples employed were 
so small as to yield extremely large standard errors, had the latter 
been computed. The sex differences reported in such studies may thus 
have been due entirely to chance enors. 

By rcleience to a table of the noniial probability curve. 
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Similarly, the wide variability existing within each sex in regard to 
any given trait renders the difTercnccs between averages less reliable. 
If all women were of identical height, for example, and all men were 
likewise equal in height, then sex dilTcrenccs in height could be re- 
liably established by comparing only one representative of each sex. 
All other samplings would yield the same dilTcrence, since variation 
within each sex would be zero. The greater the variability within 
cither group, the larger will be the standard error of the obtained 
values. In the computation of SE’s, both the number of cases and the 
variability of the group are taken into account.^ 

Overlapping. When Samuel Johnson was asked which is more 
intelligent, man or woman, he replied, ‘‘Which man, which woman?” 
This is a vivid way of expressing tlic wide individual dilTcrenccs within 
each sex, with the consequent overlapping between their distributions. 
Since in any psychological trait women differ widely from each otiier, 
and men also vary widely among themselves, any relationship found 
between group averages will not necessarily hold for individual cases. 
Even when one group excels another by a large and signilicant amount, 
individuals can be found in the “inferior” group who will surpass 
certain individuals in the ‘‘superior” group. Owing to the large extent 
of individual dillcrcnces within any one group as contrasted to the 
relatively small difference between group averages, an individuars 
membership in a given group furnishes little or no inlormation about 
his status in most trails. 

In most discussions of group differences, attention has been focused 
primarily upon avciages. For a complete picture of the relative stand- 
ing of the two groups, however, some index of the degree of over- 
lapping should be included. The best procedure w'ould be to report 
the entire frequency distributions of the two groups. This is often 
impracticable, however. A simpler alternative, in the case of normally 
distributed samplings, is to state the percentage of subjects in one 
group who reach or exceed the median (or average) of the other. 
Complete overlapping would then be indicated if 50% of one group 
reached or exceeded the median of the other."'’ If more than 50% of 

* The .slandaid cnor id’ a mean is found by the formula: • The 

Standard erroi of the difference between two means is in turn based upon the standard 
errors of the two separate means. 

•'"The curves will not coincide, c:)f coluso, if the raiti'cs are iinec]ua]. In such a 
case, complete overlapping is obtained only in the sense that one distribut'on is 
contained entirely within the other. Moreover, if either distribution is pionouncedly 
asymmetrical, such a measure of overlap may be misleading. 
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group A reach or exceed the median of group B, then group A is to 
that extent superior to group B; if less than 50%, A is inferior to B. 
Occasionally, some other value is substituted for the median as the 
point of reference. Thus the investigator might report the percentage 
of group A which reaches or exceeds the highest score obtained in 
group B, or the percentage of group A which reaches or exceeds the 
upper quarter of group B. 



10- 15- 20- 25- 30- 35- 40- 45- 50- 55- 

14 19 24 29 34 39 44 49 54 59 

Scores 

Fig. 87. Dislnhiition cT boys aiid Cji'ls on a "I’cst of Arithmetic Reason- 
ing. (Uaui from vSchiller, 42, p. 67.) 

That the establishmcnl of a statistically significant difference be- 
tween two groups docs not preclude the possibility of extcrisive over- 
lapping between them is illustrated in Figure 87. This figure gives the 
distribution curves of ltS9 bo\s and 206 girls in the third and fourth 
elementary school grades on a test of arithmetic reasoning. The aver- 
age score of the boys is 40.39 and ^’'.at of the girls 35.81. The differ- 
ence between the averages is 4.58 points and the standard error of 
this difference is only 0.85. The difference is thus over five times as 
large as its standard error and can be regarded as significant with a 
high degree of confidence. An examination of the distribution curves, 
however, reveals extensive overlapping between the two groups, a 
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very large percentage of boys and girls falling within the same range 
of scores. Moreover, 38% of the girls obtained scores higher than 
the boys’ average, and 24% of the boys scored below the girls’ 
average. 

Nature of the Measuring Instrument. It is a platitude to insist 
that, in order to obtain significant data on any question, an accurate 
measuring instrument must be employed. Yet the methods of meas- 
urement employed in the study of sex differences, as well as in other 
group comparisons, have frequently been crude and often wholly 
unsuited to the problem. Thus ratings by associates were used in 
many of the earlier investigations on sex difTcrences and especially 
in those concerned with personality traits. Teachers’ ratings of school 
children were especially common. It is obvious that such ratings do 
little more than reflect the systematic bias of the judges. In the com- 
parison of such groups as the sexes or various ‘Taces” or nationali- 
ties — about v/hich popular stereotypes exist within each culture — 
ratings cannot be regarded as an index of the subject’s actual standing. 

The reliability of the tests (cf, Ch. 2) should also be taken into 
account. If a test is too short or if performance on it is affected by too 
many irrelevant factors, it will yield different results repeated ad- 
ministrations, On such a test, the scores of the same indiviilncils will 
vary widely from lime to time. These discrepancies in lest scores are 
known as errors of tucasnremcni. Group ditfcrcnces found witli a 
short and poorly constructed test may be entirely spurious and may be 
expected to disappear upon a rc-examinalion of the same subjects. 

Much confusion has also been introduced into discussions of group 
dificrences by the relatively loose designations assigned to most psy- 
cJiologicai tests. If a test is labeled “analytic reasoning,” there is a 
tendency to assume that it actually measures that trait, although such 
a trait may not even exist as a unitary function and may consist of a 
manifold of independent abilities. Similarly, if two tests arc given the 
same name, they arc commonly regarded as measuring the same func- 
tion. A hypothetical example will show how this practice may affect 
group comparisons. Let us suppose that one investigator has con- 
structed a sentence completion test, which he labels a measure of 
“logical thinking.” In such a test, as in most verbal tests, girls will 
probably excel. If now another investigator also sets out to construct 
a test of “logical thinking” and decides to employ arithmetic problems 
as his material, he will find that boys excel in this trait. The results of 
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the two studies will thus seem to be in direct contradiction, owing to 
the use of a common term to cover two discrete types of behavior. 

Many discrepancies in the data on sex dilTerenccs may be attrib- 
uted simply to such a confusion of terminology. Unless identical tests 
are administered in an identical manner, we cannot assume that the 
same functions were measured in every case. The use of a ditlerent 
time limit, for example, might change a power test into a speed test 
and thus yield entirely dilTerent results. A slight alteration in the direc- 
tions miglit make it more diificult for the subjects to understand wdiat 
is required of them and might thereby introduce a new clement into 
the test, viz., ability to follow verbal instructions, "intelligence’' scales 
arc probably the best example of the use of general terms in describ- 
ing widely diverse tests. Much controversy has been occasioned by the 
application of such scales. Owing to the employment of ""intelligence” 
scales which sample dilTerent sets of abilities, some students of sex 
dilTerenccs have concluded that boys were more intelligent, others that 
girls were more intelligent. 

A closely related problem pertains to the use of ""lump scores” in 
group comparisons. Group dilTcrences in specific abilities may be com- 
pletely obscured by the comparison of total or average scores on a 
battery of tests. If, for example, boys excel in numerical aptitude and 
girls in verbal aptitude, and a scale of so-called general intelligence is 
weighted equally with items from both fields, no significant sex dilfer- 
cncc in total score will be found. Should the scale be overweighted 
with items of one type, on the other hand, it will favor the group 
excelling in that trait, an*' will indicate an apparent dill’erence in gen- 
eral intelligence. In recent years, with the development of factor 
analysis, there has been a growing tendency to look for group dilfer- 
ences in separate abilities rather than in “general level of perform- 
ance.” In the study of group differences, it is of the greatest impor- 
tance to state results in specific terms and to limit conclusions to the 
particular materials, procedure, and other conditions of each investi- 
gation. 


Sh\ Oir-r HRLNt'LS IiV ACI«n:VElVIHNT 

The relative intellectual achievements of men and women through the 
ages have frequently been cited as evidence of a sex difference in 
ability. An examination of any biographical directory or encyclopedia 
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shows a far greater number of men than women to have achieved 
eminence. And of the few women listed in such compendiums, many 
acquired fame through special circumstances, such as royal birth, 
rather than through the possession of exceptional talent. In Ellis’ 
study (12) of British genius, only 55 women were included in the 
total group of 1030 subjects. Nor did the standard of eminence seem 
to be higher for women than for men. On the contrary, Ellis claims 
that many of the women in his group had become famous “on the 
strength of achievements which would not have allowed a man to play 
a similarly large part” (12, p. 10). CattclFs carefully prepared com- 
pilation of the 1000 most eminent persons in the woild listed only 32 
women. Of these, 1 1 were hereditary sovereigns and 8 became emi- 
nent through misfortune, beautN, or some other circumstance. This 
leaves an extremely small number who may be said to nave distin- 
guished themselves through their superior talents (8, p. 375). 

Similar results were obtained by Castle (7) in her statistical study 
of eminent women, A total of 868 names of women were collected, 
representing 42 nations and covering a wide range of epochs from the 
seventh century n,c. to the nineteenth century. The lan^csi uunihcr of 
women in the group achieved eminence through literany pursuits. 337 
or 38.8% of the subjects being classified in this field. Ihe hii'lirst 
dcf^rer of eminence, however, as indicated by the numbei of lines 
allotted to the individual in standard biographical directories, was 
obtained by v\omen as ‘sovereigns, political leaders, molliers of eminent 
men, and mi^ticsscs. Among the other non-intellec(ual factors through 
which women achie\ed lame in the past arc h‘ted manidae, re- 
ligion, birth, philanthropy, tragic fate, beauty, and “immortalized in 
literature.” 

In more recent times, the discrepancy in number of men and 
women who have distinguished themselves in intcDectual pursuits is 
still laigc, although constantly diminishing. In the E)33 edition of 
AmciUiin Men of S^unce (cL. 9, p. 1264), 775 wamien were listed 
out of a tc'tal of 97Ss entries in the pure sciences. The percentage of 
women in the various licKIs ranged Irom 2 1% in physics to ^^2% in 
psychology. In the croup ot 250 scientists who were ncwlv “starred” 
in this edition, onlv 3 women w'crc included (9). In fact, out of a 
total of 2607 scicntist^ starred between 1903 and 1943, only 50 were 
women (50). 

^’Cf footnote 2 'n Chaptcj 17. 
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The interpretation of such achievement data is obviously compli- 
cated by the many factors besides ability which determine eminence. 
The recorded differences in acliievemcnt could be fully accounted for 
in terms of the environmental conditions which have prevailed. Many 
types of occupations have been completely closed to women until re- 
cently. Thus, on the basis of their sex alone, women have been effec- 
tively barred from achieving eminence in a number of fields. When 
women have eventually been admitted odicially to such vocations, 
prejudice and discrimination against them have still been so prevalent 
that only a few could succeed. Even today, competition is not on an 
equal basis for men and women in most occupational fields. 

Educational opportunities have likewise been very dissimilar for the 
two sexes (cf. 14), although at present the environments of the 
two sexes are more nearly equated in this respect than in any other. 
Institutions of higher learning were slow to open their doors to women. 
Although America was in advance of most other countries in the edu- 
cation of women, until nearly the middle of the nineteenth century 
there was not a single institution of collegiate rank in this country 
which admitted women. Professional and post-graduate education was 
not available until a much later date. Even in the elementary and 
secondary schools, the traditional curriculum of girls was different 
from that of boys, including much less science and more literature, art, 
and other “genteel” subjects. 

Nor can general home influences be disregarded. Even in the most 
enlightened and progressive homes, differences are introduced in the 
environments of boys and girls which may prove very important in 
determining subsequent behavior development. In general, girls are 
considered weaker and more frail than boys; they arc sheltered more 
and are taught to be neater and quieter than their brothers. Boys and 
girls are given different toys to play with and different books to read. 
All these apparently minor environmental factors, operating constantly 
and from a very early age, may exert a lasting influence upon the 
development of the child’s interests, emotional characteristics, and 
intellectual talents. 

Finally, the relatively intangible i ul highly effective factor of social 
expectancy should be mentioned. This operates to perpetuate all group 
differences, once they have been established. What is expected of an 
individual is a powerful element in the determination of what he will 
do. When such expectation has the force of social tradition behind it 
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and is corroborated at every instant by family attitudes, everyday con- 
tacts in work and play, and nearly all other encounters with one’s 
fellow-beings, it is very difficult not to succumb to it. As a result, the 
individual himself usually becomes convinced that he is “superior” or 
“inferior,” or that he possesses this or that talent, interest, or attitude, 
according to the dictates of his particular culture. 

Perhaps the follow-up studies of gifted children, discussed in the 
preceding chapter, may offer a clue to adult sex differences in achieve- 
ment. In the California study (47), it. will be recalled, the adult occu- 
pations of the women were on the whole quite undistinguished. The 
number of women engaged in careers of university teaching, research 
work, art, or writing was quite small. The reported sex dilfercnces in 
adult vocational activities arc especially noteworthy when we remem- 
ber that the men and women in this group had been so selected as to 
fall within the same IQ range in childhood. Moreover, initial 10 
showed a fairly close relationship to occupational level among the 
men, but virtually no relationship among the women.'^ In fact, two- 
thirds of the women with lO's of 170 or above were houscw'ivcs or 
office workers. 

The statistics on higher education also favored tjjc men in this 
group. Although the percentages graduating from college w'erc closely 
similar for the two sexes, many more men than women took graduate 
degrees, especially at the doctoral level. The influence of cultural tra- 
ditions, social pressure, and the common conflict between marriage 
and a career can be recognized in the follow-up of this group of intel- 
lectually superior women. Such a study may help us to understand 
some of the reasons why gifted women more rarely achieve eminence 
than do gifted men. 

ShX DIFFERENCES IN VARIAIUEH Y 

During the last decade of the nineteenth century, the doctrine of 
sex differences in intellectual variability ^ rose to prominence. It was 

' Women’s scores on the Concept Mastery Test administered in the adult follow-up 
did show a significant relationship to occupational status, but this may have been 
hugely a result of educaiional differences. The fact that the women in the upper 
occupational levels had necessaiily continued their education longer may itself have 
enabled them to do bcttei on the Concept Mastery Test (47, p. 1S4). 

^ The possibility of greater male variability in physical traits v .is originally alluded 
to by Darwin, although he does not seem to have considered the problem of great 
importance (cf. 36). 
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pointed out that, although the average ability of men and women 
might be equal, the distribution of ability in one sex might cover a 
wider range than in the other. Thus it was suggested that the varia- 
bility of intelligence among males is greater than among females, there 
being more men than women at 
cither extreme of the distribution. 

These hypothetical distributions 
are illustrated in Figure 88. It will 
be noted that, theoretically, the 
averages of two groups can be 
identical while the ranges differ 
considerably. 

The doctrine of greater male 
variability was regarded as a fun- 
damental biologic 1 1 law and was 
believed to hold for all traits, 
physical as well as psychological. 

Thus Havelock Ellis, one of its chief protagonists, wrote as follows: 

From an organic standpoint, therefore, women represent the more stable 
and conservative clement in evolution (II, p. 421) ... in men, as in 
males generally, there is an organic variational tendency to diverge from 
the average, in women, as in females generally, an organic tendency, not- 
withstanding all their facility for minor oscillations, to stability and con- 
servatism, involving a diminished individualism and variability (II, 
p. 425). 

This doctrine enjoyed a long popularity and was accepted by a 
number of psychologists during the first quarter of the present century 
(cf., e.g., 8, 48). The evidence oflercd in support of the greater intel- 
lectual variability of the male was'twofold. On the one hand, the statis- 
tics on eminence were cited as proof of the greater frequency of 
superior intellect as well as of the presence of more extreme positive 
deviants in the male sex. Similar data w^ere presented to establish the 
wader range of male intelligence at the lower end of the distribution. 
Surveys of institutions for the feebleminded in several countries re- 
vealed a consistent excess of males among the inmates. Thus it was 
argued that there were more idiots as well as more geniuses among 
men, and that women as a group tended to cluster more closely around 
the average or mediocre degrees of ability. 



Fig. 88. Hypothcticid Distribution 
of Intelligence amonc Men and 
Women according to the Doctrine 
of Greater Male Variability. 
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The cultural basis of sex differences in the attainment of eminence 
has already been discussed. No biological law need be invoked to 
account for the greater frequency of men in the biographical direc- 
tories and encyclopedias. The greater incidence of males in institu- 
tions for mental defectives has likewise been shown to result from 
cultural factors. This was especially demonstrated in a study by L. S. 
Hollingworlh (16) on 1000 cases referred for examination to a psy- 
chological clinic in New York City, as well as 1142 cases in residence 
at a New York City feebleminded institution. Analysis of intelligence 
test scores and other available data revealed the diflerential operation 
of a selective factor in the case of the two sexes. 

In the first place, the males referred for examination, as well as 
those actually committed, were on the average much younger than 
the females. Secondly, the IQ’s of the females presented tor examina- 
tion were lower than those of the males. This difference in 10 was 
even greater wdien the cases actually committed were compared. A 
survey of the previous occupations and general case histories of the 
subjects suggested that the probable explanation of these findings lies 
in the uncompetitive nature of many occupations open to women. This 
makes the detection of feeblemindedness as well as the ^necessity of 
commitment less likely among women than among men. A girl of 
moron level can survive outside of an institution by turning to house- 
work, prostitution, or marri^ige as a means of livelihood. The boy, on 
the other hand, is forced into industrial work at a relatively early age 
and will soon reveal his mental deficiency in the sevcie competition 
which he encounters. Thus, although there is an excess of males in 
institutions for mental defectives, it would seem that there are more 
feebleminded females outside of institutions. 

A similar differential selection has been found to operate in admis- 
sions to special classes for mentally retarded children in the public 
school system. In a survey conducted in Baltimore (4), results showed 
that about three times as many boys as girls were enrolled in such 
special classes. The remaining girls of corresponding ability, however, 
were found in regular public school classes.^ Apparently the differ- 
ences in social and economic conditions met by the two sexes have 
led to a “double standard” in the classification of boys and girls as 
mentally retarded. 

Karl Pearson (36) was among the first to challenge the adequacy 

®Cf. also Rigg (“^9) 
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of studying sex differences in variability by a comparison of the ex- 
tremes of the distribution. He called attention to the need for direct 
measurement of variability around the average in large groups of unse- 
lectcd subjects. Pearson himself computed coefficients of relative 
variability for several classes of data, consisting chiefly of physical and 
anatomical measurements on adults. He found no evidence of greater 
male variability, but rather a slight tendency toward greater female 
variability. Similarly, Hollingworth and Montague (18) collected a 
large number of physical measurements on 1000 male and 1000 fe- 
male infants at birth, thus ruling out any possible effects of differential 
environment. No consistent sex difference in variability was found. 

A mass of data is now available on male and female variability 
in a wide variety of traits (cf. 29, 38). In such characteristics as 
height, weight, physiological maturity, dentition, and anatomical de- 
velopment, the data are inconsistent. Tlie relative variability of the 
two sexes differs with the specific trait under consideration, the age 
of the subjects, their social and economic level, and even the particu- 
lar community in which the data arc obtained. Intelligence test results 
exhibit a similar lack of consistency. On individual tests such as the 
Slanford-Binet, no sex difference in variability is generally found; 
on many grtnip tests, boys are slightly more variable. Age is also a 
factor in determining the relative variability of the sexes on intelli- 
gence tests. The same is true of variability on special aptitude tests as 
well as in school achievement, fhe findings differ with the specific sit- 
uation, in one case the boys being more variable, in another the girls. 
In the large majority of lurthermore, the dilfercnccs in varia- 

bility in favor of either sex are too slight to be of much significance. 

In recent years, it has been possible to check the theory of sex dif- 
ferences in variability on vcr> large and representative samplings. For 
example, in a Scottish survey in which all children born on any of 
four specified days were given the Stanford-Binet, data were obtained 
on 444 boys and 430 girls with a mean age of 10 years-5 months (27). 
The sex difference in variability in this group was negligible and 
insignificant, the critical ratio of the difference being less than 1. Simi- 
larly, in an American survey, 5069 boys and 5010 girls in grades 
3 to 8 of 22 city schools were given the National Intelligence Test 
(40). No significant sex difference in variability was found with any of 
the measures of variability employed. An extensive investigation of 
this question was also conducted on American high school and col- 
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lege students, utilizing data which had been collected in a survey of 

49 Pennsylvania colleges and a number of Pennsylvania high schools 
(38). Several different comparisons were made on intelligence and 
achievement tests as well as on certain physical measures. All groups 
used in these comparisons included over 1000 persons of each sex. 
In this study, too, neither sex was found to be consistently more 
variable, the results differing not only with the area of measurement, 
but also with the measuring instrument employed. 

Another approach has been to compare the relative frequency of 
boys and girls at the extremes of the distribution of intelligence test 
scores. The California study of gifted children has sometimes been 
cited in support of the theory of greater male variability, since more 
gifted boys than girls were located in the survey. The total group 
included 857 boys and 671 girls. Among the children with IQ's of 
170 or over, there were 47 boys and 34 girls. On the other hand, in 
L. S. Holliiigworth's compilation of case studies of children with lO’s 
over 180, 16 girls and 15 boys were found (17). Witty’s group of 

50 Kansas City children with lO’s of 140 or higher included 24 girls 
and 26 boys (51). 

It should be noted that the children in the Californii^ study were 
located in large part through teachers’ recommendations. Those in 
Witty’s gr(')up were found by administering a group test of intelligence 
to the entire school population in grades 3 to 7 in Kansas City, Mis- 
souri. The Holiingworth cases were identified cither through their 
conspicuous acliic\ements or through intelligence tests administered 
for other reasons. It is thus likely that the excess of boys in the Cali- 
fornia group resulted from the effect of sex stereotypes on teachers’ 
judgments. Perhaps a girl with a high IQ was more often regarded 
by her teachers simply as a “good pupil,” while a boy with the same 
IQ was judged to be “brilliant.” 

Such an explanation in terms of selective factors is supported by 
the results of complete school surveys with intelligence tests. In a 
study in which the National Intelligence Test was given to all the 
children in grades 3 to 8 in 22 city schools, the percentage of boys 
did not dilfer significantly from the percentage of girls in the combined 
upper and lower 1% of the entire distribution (40). There were, 
how^ever, more girls in the upper 7% and more boys in tlic lower 7%. 
In a more recent survey (23) whh the Kuhimann-Anderson Intelli- 
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gence Test, in which approximately 45, ()()() children in grades 4 to 8 in 
36 states were tested, the upper 10% of the group likewise included an 
excess of girls (2676 gins v\v. 1853 boys). Among the highest 2% of 
the distribution, girls again predominated in the ratio of 146.3:100. 
Among the lowest 10%, the reverse tendency was found, there being 
3009 boys and 1618 girls (28). The largely verbal content of most 
intelligence tests, as well as their dependence upon school work, prob- 
ably gives the girls an advantage and accoiinis for their superior per- 
formance. There is, however, no evidence in these surveys for a 
greater male variability, nor for a greater freijiiency of boys at the 
upper 10 levels. 

One additional point sliould be considered in connection with the 
relative frequency (T' boys and girls at high 10 levels. With increasing 
age, gifted boys are more likely to retain their high 10 or even to 
show a rise, while girls with the same initial IQ's are much more likely 
to show a drop (10, 25, 47). In the California study, for example, 
the excess of gifted boys was much greater in the high school than 
in the elementary school sampling. Moreover, in the follow-up testing 
of boys and girls within the same initial JO ranges, the girls showed a 
greater mean drop than the boys during adolescence as well as adult- 
hood. Several explanations could be suggested for such a tiiiding. Fhc 
content of intelligence tests may favor girls more at the younger ages. 
Or girls may develop more rapidly in intellectual functions and the 
boys may “catch up” as tliey grow oldei. One plausible explanation 
is that, with increasing exposure to traditional activities and social 
pressures, the intellectual’ superior boy will on the whole continue to 
improve in intellectual functions, while the equally superior girl is 
more likely to be steered into less intellectual pursuits. Sex differences 
in educational, vocational, and avov-ational activities wxnild in turn be 
reflected in an increasing divergence of the intelligence test scores of 
the sexes with age. 

Sl-X DIFFERllNCLS IN INFRAHUMAN ANIMALS 

Since cultural factors so often con plicate the interpretation of ob- 
served sex differences in human behavior, it may be of interest to 
examine sex differences in infrahuman species. It has been argued 
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that if similar sex differences in behavior are observed at various 
phyletic levels, such differences are more likely to be directly or indi- 
rectly traceable to a structural basis. In maze performance, as well as 
in other learning tasks, sex differences in animals are inconsistent and 
negligible (35, 49). Although the experimental data on many of the 
higher forms of animals arc quite meager, there seems to be no evi- 
dence for a sex difference in ability. What differences have been 
found pertain rather to emotional characteristics. 

There is a considerable body of data — from field studies, the ob- 
servations of animal breeders and trainers, and the descriptive accounts 
of laboratory workers — all of which indicate greater aggressiveness in 
the male of most species (6, 15, 41, 44, 53). Fighting, restlessness, 
and resistance to control have been commonly reported as more char- 
acteristic of male than of female animals. That this may be related 
to the presence of mule sex hormones is suggested by a number of 
experiments involving the removal of gonads, as well as the injection 
of sex hormones. It is not only reproductive behavior which is affected 
by such endocrine factors, l)ut aKo other behavior characteristic of 
one or the other sex, such as pugnacity or singing in certain species of 
birds (20, 34. 44, 46). 

On the other hand, we must guaid against overgenerali/ing from 
such results to sex stereotypes in the human. Animal data which do 
not fit the familiar human stereotypes can also be Ibund. Carefully 
controlled studies on timidity in rats, for cxam[)lc, showed females to 
be less timid than inales (2). This sex difference persisted, although 
to a reduced degree, after the removal ol gonads from rats of both 
sexes. In another series of investigations with rats, the female was 
lound to be more at five than the male (cf. 49). Also contrary to the 
traditional human stereotype were observations made on the mating 
beha\ioi of a certain species of monkey (6), in which either sex m \y 
initiate the sexual advances preparatory to copulation. There is no 
indication that the male of this species necessarily takes the initiative 
in this respect. 

All in all, the available findings on sex differences in animal be- 
havior must be interpreted with considerable caution. Such observa- 
tions may provide leads for the investigation of possible physiological 
correlates of behavioral characteristics, but it would be premature to 
make any generalizations regarding universal sex differences in any be- 
havioral function. 
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THE ROLE OF PHYSIOLOGICAL FACTORS 

General Physical Status. Some of the animal experiments have sug- 
gested the part which sex hormones may play in the general behavior 
of males and females, over and above the role of these hormones in 
reproductive behavior. The fact that endocrine secretions are carried 
to all parts of the body through the blood stream has led to consider- 
able speculation regarding the broader behavioral effects of sex hor- 
mones. It should be noted that in terms of sex-hormone production, 
there is not a sharp contrast between males and females, but the dif- 
ference is rather one of degree. All males, besides secreting the male 
sex hormone, androgen, also secrete some female sex hormone, estro- 
gen. vSimilarly, all females secrete some androgen along with estrogen. 
It is the relative proportion of the two which determines the degree 
to which the individual develops masculine or feminine chcTaetcristics. 

Another possible source of general sex ditferenccs is provided by 
the sex-determining chromosomes themselves. It wall be recalled 
(Ch. 4) that every cell in the body receives a complete set of chromo- 
somes. For the female, each body cell contains 23 pairs of chromo- 
somes plus an XX pair; for the male, each body cell contains the 
same 23 pairs plus an XY pair. In this respect, then, the tw^o sexes 
differ in every cell of the body. This docs not mean, of course, that 
every body cell must necessarily develop dilTcrently in men and 
women, since not all genes may be active in the development of every 
cell. But these sex differences in gene constitution, repeated in every 
body cell, may provide a - leclianism to account for many of the physi- 
cal differences betw'een the sexes. 

Sex differences have, in fact, bcjn reported for almost every physi- 
cal variable, including body build, minute anatomical characteristics, 
physiological functioning, and biochemical composition (46). More- 
over, the difference in most of these respects increases with age. Thus 
the human male averages approximately 5% heavier than the female 
at birth and 20% heavier by age 20; in height, the male excess in- 
creases from about 1% or 2% ir- childhood to about 10% by age 
20.^^ Muscular strength shows a consistent difference in favor of males 
at all ages (21, 46). From early infancy, males likewise exhibit 

^'DiiLng a few years in caily adolescence, girls are on the average taller and 
heavier than boys, but this lesults fiorn the developmental acceleration of girls, to 
be discussed in thf subsequent section. 
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greater “muscular reactivity,” as illustrated by a stronger tendency 
toward restlessness and vigorous overt activity. Of possible relevance 
to such an excess of muscular reactivity is the greater mean vital capac- 
ity of males. This difference is especially significant because vital 
capacity is an important factor in sustained energy output. In early 
childhood, ihe average vital capacity of boys is about 7% greater than 
that of girls; by adulthood, the male excess reaches about 35%. The 
vital index, or ratio between vital capacity and body weight, is likewise 
greater for males at all ages at which measurements have been made. 
Thus, even in proportion to his body weight, the human male consumes 
more fuel and produces more energy than the female. 

All these physical differences may play an important part in sex 
differences in play activities, interests, and achievement in various 
fields of work (46). It is reasonable to expect, for example, that the 
greater strength and ntotility of boys increase the likelihood of their 
manipulating mechanical objects, and thus indirectly facilitate the 
development of clearer mechanical concepts. Aggressiveness and dom- 
inance in social relations may likewise be initially fostered by greater 
body size, strength, and endurance. 

Rate of Maturation. It has been clearly established 4hat girls not 
only reach physical maturity earlier than boys, but that throughout 
childhood they arc farther advanced toward their adult status in phys- 
ical development (41, 45, .46). Several investigators have compared 
the height and weight of boys and girls at successive ages. In order 
directly to compaie the developmental status of the two sexes in these 
trails, each age average can be expressed as a percentage of the adult 
norm for that sex. In I able 36 will be found such percentages for boys 
and girls betw^cen the ages of 6 and 1 7. the figures being based upon 
data from several investigations. It will be noted that at each age 
measured, the girls have attained a greater percentage of their adult 
height and weight than the boys. Similar results were obtained in an 
extensive investigation by Baldwin (3), in which the same subjects 
were measured at successive ages. At certain ages the developmental 
acceleration of the girls is so great that they are actually taller and 
heavier than boys, in absolute measures. In Baldwin’s data, the girls 
were found to be superior in height between the ages of li and 13, 
and in weight between 9 and 16. 

Vital capacity is the total volume of air that can be expelled from the lungs 
after a maximal inhalation. 



Sex Differences: Basic Problems 


633 


TABLE 36 Percentage of Final Growth Which Has Been Attained at 
Ages Preceding Maturity 


(I'rom Kincolii, 21, p 20) 


Age 
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100 
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]6.5 

97.5 

99.2 

100 

100 

15.5 

94.5 

98.3 

88.7 

95.1 

14.5 

90.3 

96.3 

78.9 

87.4 

13.5 

8().4 

93.3 

70.0 

79.0 

12.5 

83.4 

89.4 

63.5 

70.0 

11.5 

80.6 

85.6 

58.4 

61.8 

10.5 

78.0 

82.5 

54.1 

56.0 

9.5 

75.1 

79.3 

49.0 

51.0 

8.5 

73 3 

76.1 

45 0 

46.7 

1,5 

69 1 

72.8 

40 9 

42.4 

6.5 

65.9 

69.0 

37.4 

38.5 


Other aspects of physical development show a similar acceleration 
of the female sex. It is a well-known fact that girls reach puberty 
earlier than boys, the dillerence averaging from 12 to 20 months in 
various groups. Skelelcil developineiit can be measured by the relative 
degree of ossification, or hardening, of the bones in different parts of 
the body. In this also, girls have been found to be in advance of boys 
at every age (cf. 41. 46). A similar dillerence has been found in 
dentition. In general, girls shed their deciduous teeth sooner and get 
their permanent teeth at an eailicr age than boys. In the case of cer- 
tain teeth, these dilTercnccs amount to one year or over (41. 46). 
I he general developmental .acceLration of girls begins hejore birth. 
Girls arc on the average mure mature than boys at birth and there is 
some evidence which indicates that they tend to be bor n after a 
shorter gestation period than boys (41 ). 

The significance of sex dilTercnces in the rate of physical growth 
has been emphasized by .several writers (cf., e.g., 5, 24, 36, 41). It 
has been suggested that girls may v.. icceleratcd in intellectual as well 
as physical dWclopmcnt. If this were the case, equaled age groups of 
boys and girls would not be comparable. It would then be necessary' to 
equate the sexes in regard to developmental stage or physical maturity 
rather than chronological age. But such a procedure would introduce 
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an inequality in amount of training and general environmental stimu- 
lation. This problem, of course, arises only in the comparison of chil- 
dren, and does not apply to adults. Children, however, have been the 
most frequent subjects for surveys on sex differences, both be- 
cause of their greater accessibility in large numbers and because 
they have been exposed to a relatively more homogeneous environ- 
ment. 

It should be noted that intellectual acceleration of girls has not been 
directly demonstrated. Its possibility has only been inferred by analogy 
with physical development. It is doubtful, however, whether physical 
maturity can have much influence upon intellectual development. The 
data on the relationship between psychological and physical traits are 
too consistently negative for such an assumption (cf. Ch. 12), In emo- 
tional and other personality traits it is probable that the onset of 
puberty and the relative physiological maturity of the individual intro- 
duce an uncontrolled factor in sex comparisons at certain ages. But 
in regard to the individual’s intellectual status, the environmental 
stimulation to which he has been exposed is far more significant than 
slight differences in physical condition. 

Another possible implication of the developmental acceleration of 
girls is a social one (41). Because of their physical acceleration, ado- 
lescent girls tend tc associate with boys older than themselves. This 
probably accounts also fo^ the usual age discrepancy in marriage. 
Since the girl is generally younger than the boys with whom she asso- 
ciates — and younger than the man she marries — she is surpassed by 
most of her male associates in education, intellectual development, and 
general experience. Such a situation may well be at the root of many 
social attitudes toward the two sexes. A younger individual is likely to 
have less wisdom, information, and sense of responsibility than an 
older one, and such an age difference may have been interpreted and 
fostered as a sex difference. 

Viability and Physical Defects. At all ages, the female shows more 
“viability,” or capacity to maintain life, than docs the male. The inter- 
pretation of mortality statistics in adulthood and even in later child- 
hood is, of course, complicated by differential hazards met by the two 
sexes in their traditional occupational and recreational activities. That 
the higher mortality rate of males cannot be explained wholly on this 
basis is, however, indicated by several facts. 

First, prenatal and infant deaths are more common among boys 
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(41, 46). It has been estimated that the ratio of male to female con- 
ceptions lies between 120:100 and 150:100. Although 20% to 50% 
more boys are conceived,’® however, only 5% or 6% more boys than 
girls are born. Thus even before birth, death has already taken a 
much greater toll from the male sex. At every stage of prenatal devel- 
opment, the percentage of male deaths is greater than that of females. 
Moreover, this difference in viability is not limited to the human, but 
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Fig, 89. Dramatized Schema Illust'a+irig Why Males Have More Hcredi- 
Uiry Defects. iFrom Scheinleld, 41, p. UO,) 


The reason for the c\cc«^s of male conceptions is not clear. Tt has been sug- 
gested that the male-prodiKing. or Y-beaniig, spermatozoon is lighter and more 
motile than the X-hcaring speirnalo/oon. Another possibility is that the male-pro- 
ducing spermatozoon has a better chance of survival in llv uterine environment for 
either chemical oi physical reasons. 
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is true for lower animals as well. Throughout life, the male appears 
to be biologically more vulnerable in many ways. He is more sus- 
ceptible to infection and is more often afflicted with physical defects. 
All but a very small number of defects are more common among 
males. 

One reason for this sex difference in viability and in physical dis- 
orders may be found in the sex chromosomes. Since the female re- 
ceives two X-chromosomes, the effect of a defective gene in one of 
these chromosomes may be counterbalanced by a normal gene in the 
other. This relationship is illustrated in the dramatized diagrams 
shown in Figure (S9. The male, on ihc other hand, receives only one 
X-chromosome. The Y-chromosomc contains relatively few genes, 
and it is doubtful whether any (>f them are counterparts of X-gcncs. 
It is thus much more likely that a defective gene in the male will find 
no normal counterpart to check its effect (Fig. 89). This relationship 
between corresponding genes in each pair of chromosomes can per- 
haps be best understood when we realize that a defective gene is 
probably one lacking in certain essential chemical substances. Such a 
deficit can be overcome by the presence of the same substance in the 
corresponding normal gene. 

We could speculate at length regarding the possible social implica- 
tions and indirect psychological effects of the greater viability of the 
female. Thu example, one result is the increasing excess of women at 
the upper age levels — a condition which influences the rekitive oppor- 
tunity for marriage. A proportional scarcity of males makes marriage 
a more competitive uiiderlaking for the female than for the male. This 
situation could in turn be rellccted in divergent personality develop- 
ment in the two sexes. 

Homeostasis. An interesting physiological concept which has re- 
ceived increasing attention in discussions of sex differences is that of 
homeostasis, or the stal^dity of bodily functions. ’T here is considerable 
evidence suggesting that homeostatic mechanisms, which tend to keep 
the body in its normal condition, operate within narrower limits in 
the male (46). Thus men show less tluciuation in such measures as 
body temperature, basal metabolism, acid-base balance of the blood, 
and level of blood sugar. The observation that females are more sub- 
ject to flushing and fainting and to various glandular imbalances has 
likewise been cited cis evidence of their greater physiological insta- 
bilitv. 
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From these differences in physiological homeostasis, some writers 
have proposed a parallel sex difference in “mental homeostasis” (20). 
Vo this they attribute the greater “psychic unrest” of the female, as 
evidenced by more frequent emotionality, neurotic tendencies, nerv- 
ous habits, feelings of inadequacy, and other symptoms of instability. 
The analogy is interesting, but we must proceed with the utmost cau- 
tion in making such a transition from physiological to behavior data. 
Even in physical functions, exceptions can be found to the greater 
stability of the male sex. Moreover, we cannot assume that psycho- 
logical and physiological homeostasis are necessarily related to a very 
high degree. It is true, for example, that physiological changes occur 
during emotional excitement, but it docs not follow that individual 
differences in emotionality are correlated with individual differences in 
physiological characteristics. Furthermore, the physiological changes 
themselves may be influenced by the individual's previous experiences, 
home background, and the like. In fact, the evidence on individual 
differences in personality development tends to emphasize the role of 
experiential factors. Such factors may be equally important in deter- 
mining sex differences in behavior. 

THE ROLE OF CULTURAI. F ACTORS 

That sex roles and sex stereotypes vary in different times and places 
is apparent not only from anthropology but from our own cultural 
history as well. To be sure, a few persistent differences in behavior 
can be identified. These undoubtedly result Irom some of the physical 
differences considered in the preceding section. Thus the widespread 
prevalence of male dominance in iifferent cultures may be historically 
related to sex differences in physique and muscular strength. But the 
amount of such sex differences in dominance varies widely from cul- 
ture to culture, as does the manner in which it is expressed. Moreover, 
many characteristics associated with the traditional male sterecMype 
in our culture may be absent or reversed jn other cultures (cf, 22). 

Occupations have traditionally p rovided one of the principal cul- 
tural areas of sex differentiation, in relatively primitive cultures, in 
which occupations arc predominantly physical, a sharp division of 
labor between the sexes is necessarily observed. Figure 90 presents a 
summary of the sex distribution of ten principal activities, based upon 
data from 224 representative tribes throughout the world. Because the 
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Fig- 90. Division of Labor Between the vSexes in 224 Representative 
Tribes Tbroughoiit the World, Size of figures shows the approximate 
degree to which each sex participates exclusively, predominantly, or to- 
gether with the other sex. Heavy lines beneath figures indicate approximate 
total percent of male or female participation in each occupation, black 
space indicating male and white space female participation. (From Schein- 
feld, 41, p. 293. Original data from Murdock, G. “Comparative Data 
on the Division of Labor by Sex,” Soc. Forces, 1937, 15, 551-553.) 
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female bears and suckles the young, she is likely, in such primitive 
cultures, to engage in occupations which keep her closer to home, 
such as the preparation of food and the manufacturing and repair of 
clothing. Superior muscular strength and endurance cause the men 
to take over warring, metal work, hunting, and most of the fishing. 
But modern occupations do not fit into these primitive categories (41, 
44). Even modern warfare is not so much a matter of handling spears 
and javelins as it is a matter of pushing buttons and designing blue- 
prints. Paradoxically, it is the home that is now one of the principal 
loci of physical occupations, in contrast to the oflice, the store, the 
conference room, or the auditorium. With the development of ma- 
chinery, the physical demands of more and more occupations are 
becoming reduced. Our thinking should not, therefore, be hampc»'ed 
by traditional stereotypes, but rather should be guided by the demands 
of the specific situation and the abilities of the specilic individual. Sev- 
eral writers have called attention to the need for revising out concep- 
tion of sex roles in terms of developments in modern living (32, 43). 

That women have no ‘'natural affinity” for certain tasks, nor men 
a “natural repugnance'" toward their performance, can be amply illus- 
trated. Huxley and Haddon ( 19, p. 69). in discussing the influence of 
social pressure upon sex differences in aptitudes, cite the remark of 
the third century Greek gossip writer, Athenaeus, ‘^Whoever heard of 
a woman cook?" In the same vein. Mead (31, p. xix) calls attention 
to “the convjntion of one Philippine tribe that no man can keep a 
secret, the Manus assumption that only men enjoy playing with babies, 
the Toda prescription oj almost all domestic work as too sacred for 
women, or the Arapesh insistence that women's heads are stronger 
than men's.” Other illustrations can be found in the history of our 
own culture. Most writers on the social history of the Middle Ages, 
for example, call attention to the “masculine character" of women of 
that period. Thus Garreau, writing about France at the time of the 
crusades, has this to say: 

A trait peculiar to this epoch is the close resemblance between the 
manners of men and women. The ri.'v (hat such and such feelings or acts 
are permitted to one sex and forbidden to the other was not fairly settled. 
Men had the right to dissolve in tears, and women that of talking without 
prudery. ... If we look at their intellectual level, the women appear 
distinctly superior. They arc more serious; more subtle. With them we do 
not seem dealing with the rude stale of civilization that their husbands 
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belong to. ... As a rule, women seem to have the habit of weighing 
their acts; of not yielding to momentary impressions.^^ 

The play activities of boys and girls have been a subject of frequent 
discussion. Some would argue, for instance, that girls play with dolls 
because of a nascent “maternal drive” or some similar innate interest 
or emotional trait characteristic of their sex. The almost complete ab- 
sence of this type of play activity among boys has accordingly been 
regarded as indicative of a fundamental biological diversification in 
emotional response. An observation made by Mead (30) in her 
studies on the island of Manus in New Guinea is of interest in this con- 
nection. Dolls are ordinarily unknown to the children on this island. 
But when they were presented for the lirst time with some wooden 
statuettes, it was the boys and not the girls who accepted them as 
dolls, crooning lullabies to them and displaying typical parental be- 
havior. I his reaction can be understood in terms of the pattern of 
adult behavior in Manus. Owing to the traditional division of labor, 
the women arc busy with their various duties throughout the day, 
while the men have much more leisure time between their activities 
of hunting and fishing. As a result, the father rather than the mother 
attends to the children and plays with them. This socially established 
differentiation of behavior was renected in the play responses of the 
boys and girls. 

Another vivid illustration of the role of cultural factors in sex dif- 
ferences in behavior is furnished by a subsequent series of observa- 
tions reported by Mead (31 ). 1 hese concerned the traditional emo- 
tional characteristics of men and women in three primitive societies 
in New Guinea. The three groups were sharply contrasted in the pat- 
tern of male and female personality which they presented. Among the 
Arapesh, both men and women displayed emotional characteristics 
which in our society would be labeled distinctly feminine. In this 
group both sexes are trained to be cooperative, un aggressive, gentle, 
non-competitive, and responsive to the needs of others. They arc 
strongly imbued with a sense of obligation toward any who are weaker 
or younger than themselves. Fven their typical response toward mate- 
rial objects is not one of possession but of solicitude. 

The Mundugumur, a river-dwelling tribe of cannibals and head- 
hunters, present a sharply contrasting picture. In this society both men 

Cidrreau, L. L'eiat .social dc la fiance an temps dcs Croisadcs. Paris, 1899. 
Quoted in 1, p. 199. 
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and women arc violent, aggressive, ruthless, and competitive. They 
take great delight in action and in fighting. They are quick to perceive 
an insult and ever ready to avenge it. Because of an intricate system 
of family organization, the child is born into a hostile world, in which 
most members of his own sex are his enemies. This is particularly 
true of boys, but a child of cither sex will be disliked and resented by 
some members of the family. 

Perhaps the most interesting pattern is presented by the Tchambuli, 
among wliom there is a genuine reversal of the sex-attitude of our 
culture. It is the women who have the position of power in Tcham- 
buli. The group depends for its food supply upon the fishing of the 
women, the men rarely engaging in this activity. Fish is also the staple 
product of trade, in exchange for which several essential comme^di- 
ties are obtained. Similarly, it is the women who make mosquito bags, 
the most important arlicic of Ichambuli manufacture and m great 
demand by outside purchasers. The men, on the other hand, engage 
predominantly in artistic and non-utilitarian pursuits. Most men are 
highly skilled in more than one art, including dancing, carving, paint- 
ing. and others. It is the man in this society who is concerned with the 
beauty and elaboration of his costumes and the excellence of his 
artistic accomplishments. This type of life is rellectcd in pronounced 
personality dilferences between the sexes. 1'he women are impersonal, 
practical, and enicient. Their attitude toward the men is one of kindly 
tolerance and appreciation. The men are graceful, artistic, emotionally 
subservient, timid, sensitixe to the opinions of others, and throughout 
their lives dependent upoii die security afforded to them by the w'onien. 

As in our society, each of these three cultures has its ‘Tleviants,” its 
maladjusted individuals whose pv sonality trails clash with the ac- 
cepted standards. But the deviant m one society often coincides with 
the traditional ideal ot another. Thus the “masculine'' w^oman among 
the Tchambuli is one who embodies the typically feminine character- 
istics of our society; the “elfcminale" Tchambuli man displays be- 
havior which we would characterize as typically masculine. In a final 
evaluation of her findings. Mead w-^' cs; 

We arc forced to conclude that human nature is almost unbelievably 
malleable, responding accurately and contrastingly to contrasting cultural 
condition'^. The differences bctw'cen individuals who are members of dif- 
ferent cultures, like the differences between individuals within a culture, 
are almost entirely to be laid to differences in conditioning, especially 
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during early childhood, and the form of this conditioning is culturally 
determined Standardized personality differences between the sexes are of 
this order, cultural creations to winch each generation, male and female, 
IS trained to conloim (31, pp 280-281). 

It IS apparent that cultural factors play an important part in the 
dilTcrcntiation of sc\ roles and m the conesponding sex diffcienccs in 
behavior Moreover, even when ph^^ical dillcien^cs contribute to sex 
difleiences in beha\ioi, the contiibution is usually indirect and intri- 
cately o\ Cl laid with cultural factors In such cases, it is the social 
unphcations of such physical differences, rather than tin biological 
s^x differences themselves, which kad to divergent peisonahty devel- 
opment m the two sexes. 
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Sex Differences: 
Major Results 


The data which will be surveyed in the present chapter concern 
sex differences under existing conditions in our society. Such data, 
although limited in their application, are not without value. Thus it is 
of considerable practical interest to ascertain the typical behavioral 
characteristics of men and women, whatever may be the origin of the 
differences. The number of situations in which such knowledge may 
prove useful is legion. In many fields of activity, definite assumptions 
arc made in regard to existing sex differences in aptitudes, interests, 
emotional responses, and similar traits. This sex d^erentiation is 
noticeable in advertising and selling, job placement, political cam- 
paigning, the organization of newspapers and magazines, social work, 
crime prevention, and Ihg treatment of offenders, to name only a few 
outstanding examples. In a desciiptivc account of any one cultural 
group, the question of sex differences in behavior can be legitimately 
raised. Regardless of whether such differences are the indirect result 
of structural dissimilarities or whether they have an exclusively cul- 
tural or environmental origin, they cannot be ignored in the practical 
adjustments of everyday life. 

It is also possible that a careful analysis of the material on sex 
differences, in conjunction witli other available information, may help 
to clarify the nature and source of such differences. Such an approach 
can never furnish a conclusive account of the origin of sex differences, 
but it may indirectly yield some corroborative evidence on this 
problem. 

In view of the problems discussed in the preceding chapter, such as 
selective factors, extensive overlapping of groups, errors of sampling, 
errors of measurement arising from inadequacy of the tests, and un- 
warranted generalizations regarding the functions measured, it is obvi- 
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ously difficult to formulate any summary statements regarding sex 
differences from the data of a number of independent investigations. 
This is especially true since such investigations differ widely in num- 
ber and kind of subjects, specific tests or materials employed, and 
other important conditions. Similarly, all but the most recent and best 
controlled studies fail to report reliabilities of differences, degree of 
overlapping, and other essential facts, thus making it difficult to eval- 
uate their findings. In the face of these conditions, the only available 
criterion for the acceptance of a conclusion is the consistency of re- 
sults of different investigators, A survey of the experimental literature 
on sex differences reveals certain major findings which are so fre- 
quently reported by different investigators as to suggest a valid basis 
in fact. It is with these findings that we shall be primarily concerned.' 

SIMPLE SENSORl-MOTOR FUNCTIONS 

In sensory acuity, sex differences are slight and inconsistent, with the 
exception of the female superiority in color discrimination. Color- 
blindness is found in about eight times as many men as women; the 
most common, oi; red-green, form of color-blindness occurs in about 
4% of the general male population and only about 0.5% of the 
female. There is fairly conclusive evidence, moreover, that the most 
common form of color-blindness is a sex-linked hereditary deficiency 
fcf. Ch. 4). Even among individuals of normal vision, females excel 
in color discrimination. For adults, such a difference could be attrib- 
uted to the greater amouij of practice which women have had in the 
use of color, as in matters of dress, embroidery and other needlecrafts, 
interior decoration, and the like. T:.e female superiority in color dis- 
crimination has, however, been observed even in early infancy (80). 
These results raise an interesting point which may also be related to 
the explanation of certain other sex differences in behavior. Owing to 
sex differences in rate of maturation (Ch. 18), it is possible that 
female infants excel male infants of the same tffironological age in 
their color responses simply becaus • the females are farther along in 
their development. The fact that girls have a “head start” in this func- 

^No attempt will be made fo piescnt a suivcy of specific studies on sex diffet- 
ences in behavior. Such material has been petiodically reviewed by vaiious writers. 
Cf. Allen C), Goodenough (28), L. S. Hollu gworth (39, 40), Lincoln (51), Loiiltit 
(54), C, C. Miles (60), Wellm.m (94), Woolley (87, 100, lOl), and the more recent 
reviews by Johnson and Termnn (42) and Terman et ah (86). 
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tion may in turn tend to perpetuate their advantage, through greater 
interest and experience in handling colors. In other sensory modalities, 
such as taste, smell, hearing, and touch, the data cither show no sig- 
nificant sex dillerencc or are difficult to interpret because of the 
presence of uncontrolled factors. 

In tasks involving the rapid perception oj details and frequent shifts 
of attention, women generally excel. This is one of the principal abili- 
ties measured by clerical aptitude tests, on which women make a con- 
sistently better showing than men. In the norms reported for the 
Minnesota Clerical Test, only about 16% of male workers in the gen- 
eral population reached or exceeded the median of female workers 
in checking similarities or dilTerences in lists of names and numbers 
(65) . Moreover, a scries of difiercnL investigations showed a significant 
female superiority on this test from the fifth grade through the senior 
year of high school (65, 74). 

Fairly large and consistent sex dilTerences have been reported in 
various aspects of motor perjormance. On the average, boys surpass 
girls not only in muscular strength, but also in speed and coordination 
of gross bodily movements. This dilTerence has been observed from 
infancy. In extensive observations of children of prc^chool^age, Gcsell 
and his co-workers (25) found that boys were faster and made fewer 
errors in walking a series of narrow boards. The boys also achieved 
more accuracy and greater distance in throwing a ball ilian did girls 
of the same age. In connection with the latter observation, a study 
was also made of the characteristic ball-throwing pattern of boys and 
girls. A clear-cut sex difrerenliation in the typical ball-throwing stance 
was already apparent among 5- and 6-year-olds. Males of all ages 
average better than females on such coordination tests as aiming and 
tracing. Men have also been found to have shorter and more consistent 
reaction times than women. 

In manual dexterity, on the other hand, girls generally excel (25). 
In early childhood this is exemplified by the fact that girls are usually 
able to dress themselves at an earlier age and more efficiently than 
boys. Girls’ superior control of finger and wrist movements is also 
indicated in such behavior as hand washing and turning door knobs. 
In the standardization of the 1937 Stanford-Binct, more girls than 
boys passed the tests on buttoning and on tying a bowknot. The 
statistical significance of these sex differences was exceptionally high 
(59). That adult women can perform many manipulatory tasks more 
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quickly and accurately than men has been widely recognized in indus- 
try, This fact was especially apparent during World War II, when 
women were frequently assigned to assembly, inspection, and similar 
industrial operations. Such an observation is also supported by apti- 
tude test performance. On tests like the O’Connor Finger Dexterity 
Test, O’Connor Tweezer Dexterity Test (cf. 8), and Purdue Pegboard 
(90), the norms for adult women are consistently higher than those 
for men. 

The male superiority in gross bodily movements may be largely the 
result of such structural factors as muscular strength and bodily size 
and proportions. The female advantage in manual dexterity and speed 
and control of fine movements, on the other hand, may arise initially 
from the developmental acceleration of girls. In general, delicate 
movement follows gross hcxlily movement within the development of 
the individual. Giils would thus be expected to develop tine motor 
coordinations at an earlier age than boys. These initial, structurally 
determined sex ditTercnces may affect the acquisition of interests and 
skills, thereby setting in motion a progressive mechanism of dilfcren- 
tiation between the sexes. 

It has been suggested, for example, that the differences in motor 
development may help to explain why girls play with dolls much more 
commonly than boys.- The detailed hand movements involved in 
dressing and undressing dolls and in related play activities may appeal 
more to girls because of their superior manual dexterity. At the same 
lime, it would be legitimate to ask, “Why dolls'r On the basis of 
manual dexterity alone, many other types of toys would qualify, in- 
cluding erector sets, mechanical puzzles, and toy clocks which could 
be taken apart and rc-assemblcd. ilut girls in our society are not gen- 
erally given such toys. Moreover, they usually receive their first doll 
when they are too young to do anything but throw it across the room, 
in typically “masculine” fashion. This is but one illustration of the fact 
that cultural influences are so inextricably involved in behavior, from 
its earliest beginnings, that it is well-nigh impos'^iblc to trace the effect 
of structural factors per sc. 

INTELLECTUAL EUNCIIONS 

General Intelligence. On most intelligence tests of the common 
verbal type, sex differences arc slight, but more often in favor of girls 

^ Dr. Helen Thompson, quoted jn Schemfeld (73), p. 92. 
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than boys (44, 86). On such widely used group tests as the National 
Intelligence Test, for example, girls excel consistently (44, 70). Selec- 
tive factors in sampling sometimes produce misleading results. Thus in 
high school groups, boys generally obtain higher averages than girls, 
since the duller boys tend to drop out of school in larger numbers than 
the duller girls (Ch. 18). Similarly, the exclusion of institutionalized 
cases or of children in special classes tends to favor the boys, since a 
larger proportion of girls of correspondingly low intelligence remain 
in regular classes (Ch. 18). When samplings are fairly comparable, 
however, most intelligence tests show either no significant sex dif- 
ference or a slight dilfercnce in favor of girls. 

1 he female advantage on many intelligence tests has been found 
from early childhood to late maturity. Jn one study (2^:1) on preschool 
children, the Kuhlmann-Binct was given to 50 boys and 50 girls at 
each of the ages 2, 3, and 4. The average TO of the girls was higher 
than that of the boys, the diJTercnce persisting when the children were 
retested after a lapse of six weeks. Thus the obtained diUcrence could 
not be attributed to chance lluctuations or to a sex dillerence in re- 
sponse to novel situations. A similar difference w'as louud in the study 
of mental growth and decline conducted in rural New Tfigland com- 
munities and described in Chapter 9 (16). Within the entire sample 
of 581 men and 607 women between the ages of l(t and 60, the women 
obtained a significantly higher average on the Army Ai}>ha. In some of 
the separate age groups, the sex difference was insignificant, although 
still favoring the women. 

Surveys with the Stanford-Binet have tended to show negligible sex 
differences. In the Scottish survey discussed earlier (Chs. 3 and 18), 
all children born in Scotland on each of four specified days w^ere given 
the 1916 Stanford-ITinct. The 444 boys thus tested had an average 
TO of 100.5 and the 430 girls 99.7. The critical ratio of this difference 
is only .86, indicating that the difference is no greater than one would 
expect from any two samples of the same sex. A fundamental point 
to consider in this connection is tliat in a carefully standardized indi- 
vidual test such as the Stanford-Binet, items which favor one sex or 
the other arc deliberately omitted. This is particularly true of the 1937 
revision of the Stanford-Binet (cf. 59). Itcijis which showed a large 
sex difference in the percentage passing were excluded entirely, on the 
assumption that sex differences cn such items may be specific to the 
task in question and may simply reflect differences in experience and 
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training. Among the remaining items, those slightly favoring girls were 
balanced against others which favored boys to an equal degree. The 
fact that no significant sex difference in 10 was found in the standard- 
ization sample of the 1937 Stanford-Binet is therefore an index of the 
care with which this procedure was followed, and has little or no bear- 
ing upon sex differences in inlelligencc. 

Whether boys or girls obtain higher IQ’s depends upon the items 
which are included in the test. When no deliberate effort has been 
made to exclude sex differences from the test, there has generally 
been a tendency to favor girls. I'his follows from the fact that intelli- 
gence tests consist so largely of verbal items, on which girls are supe- 
rior.'* In so far as the tests depend upon memory, girls have an addi- 
tional advantage.* Moreover, many intelligence tests arc validated 
against school achievement, in which girls also excel, especially at the 
elementary school level, it is apparent from this discussion that the 
question of which is the more “intelligent’' sex is somewhat ambig- 
uous. In the light of what we now know about trait organization and 
the nature of intelligence (Ch. 15), this is not surprising. It is much 
more meaningful to ask wluit sex differences exist in the more specific 
functions which make up 'Intelligence'’ in our culture. 

Special Aptitudes, r’cmale superiority in verbal or linguistic func- 
tions has been noted from infancy to adulthood (58, 86). This differ- 
ence is found in almost every aspect of language development which 
has been studied, and has been reported with remarkable consistency 
by different investigators. In fact, the few results which faff to support 
this difference — or, more rarely, reverse it — can usually be explained 
either by selective factors or by the use of material which appeals much 
more to the interest of boys (cf. 5 ^ ). Observations on normal as well 
as on gifted and feebleminded children have shown that on the average 
girls begin to talk earlier than boys. vSimilarly, girls of preschool age 
have a larger vocabulary than boys. In one study (57 ), the percentage 
of comprehensible vcibal responses was determined for each child. 
At 18 months, the average per cent was 14 for boys and 38 for girls; 
at 24 months it w:is 49 for boys and 98 for girls. Girls likewise begin 
to use sentences earlier than boys and tend to use more words in sen- 
tences. In learning to read, girls make more rapid progress than 
boys (72, 98). 

■^ 'Ihe evidence for these sex differences in specific inteUectual functions will be 
discussed in the follow'ing section. 
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Girls also reach maturity in articulation at an earlier age than boys. 
The articulatory patterns of girls in the first school grade are approxi- 
mately the same as those of boys in the second grade. This develop- 
mental difference in the motor aspects of speech may provide a clue 
to the general female superiority in linguistic functions. The accelera- 
tion of girls in physical development probably accounts for their more 
rapid progress in articulation. 7 his in turn may give them a powerful 
initial advantage in the mastery of all phases of language. Such a dif- 
ference in developmental rate may also aceount in part for the much 
greater frequency of reading disabilities, stuttering, stammering, and 
other speech disorders among boys. The ratio of male to female stut- 
terers varies from 2;1 to 10:1 (75, 76). In a survey of 17 groups of 
reading disability cases (5), the proportion of boys varied from 60% 
to 100%. If, in speaking and reading, boys are more often held up to 
standards which they arc not structurally ready to meet, they may 
experience more frustralion, loss of confidence, and confusion in lin- 
guistic situations than girls (58, 75, 76). This may be an important 
factor not only in the development of linguistic disorders but also 
in the normal individuafs subsequent progress in verbal functions. 

The verbal superunily of girls persists throughout the successive 
educational levels, the sex difference often becoming more pronounced 
at the upper levels. Girls usually excel in speed of reading and in such 
tests as opposites, analogies, sentence completion, and story comple- 
tion. In a study of the language development of children in grades 
4 to 12, 472 boys and 514 girls were asked to write a composition 
on a prescribed topic of interest to' both sexes. Within the same time 
limit, girls produced longer themes than boys, the elementary school 
boys using on the average 86% as many words as the girls, and the 
high school boys 83% (45). 

A mass of relc>^ant data is also provided by the analysis of sub-test 
performance on intelligence tests. In the standardization sampling of 
the 1937 Stanford-Bmel (59 ), a significantly greater percentage of girls 
passed some of the sentence completion and code learning tests.*^ In 
the two investigations with the Prcsscy Group Test of Intelligence re- 
ported in the preceding chapter, an analysis of sub-test scores show^ed 
fairly consistent sex differences in the elementary school and high 
school samplings (9, 69). In both samples, the girls c celled in word 

^ The specific tests arc Minkiis Completion (Form L, Year Fevcl XII, lest 6) 
and Codes 1 (Form M, Average Adult Level, Test 4). 
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completion and dissected sentences, and in the elementary school 
group they also surpassed the boys in opposites and analogies. It will 
be recalled that among the high school seniors, owing to»the dilleren- 
tial elimination of male and female students, boys excelled in total 
score on this test, whereas girls had excelled at the elementary school 
level. Performance of boys and girls on the separate tests, however, 
showed the same relative standing in both edueational groups. 1 he 
reversal in total score from the elementary to the high school groups 
resulted from the fact that the high school senior boys excelled by a 
much larger amount in the same tests in which the elementary school 
boys excelled. wSimilarly, the high school senior girls excelled by a 
smaller amount in the tests in which the elementary school girls had 
excelled markedly. 

Large and significant sex ditTerences in verbal functions have also 
been found on the psychological tests administered to entering college 
freshmen. Table 37 shows the average scores of several thousand cn- 


TABLE 37 Sex Differences auunu: Collei^e Fresfuuen in the Verbal 
and Mcnhematieal Sections of the SrhoUistic Aptitude Test 

( fioin Id, ]> >8.0 



Ninnher of Cases 

A 1 ^ 

; Sc ore 

\oibal 

Mathematical 

Boys 

4214 

486.58 

511.15 

Girls 

.1362 

512.29 

476.74 

dill . , Oil 1 1 / . 


11.34 

15.27 


tering students of each sex on the verbal and numerical parts of the 
Scholastic Aptitude Test administered in 1931 by the C'ollegc Entrance 
Examination Board. The ditlerence in favor of the girls on the verbal 
section is over eleven times as large as its standard error, considerably 
larger than the critical ratio of 2.58 rcviuircd at the .01 level of con- 
fidence.'' On the American Council Psychological Examination 

In the sjmplings tested during the later 19.'t0’s and the decade of the 1940’s, this 
sex dinerencc tended to disappcai, the boys’ and giils’ averages in the vcibal scoie 
being piactically ulentical. Changes in college admission policies as they adcclcd tJie 
two sexes, as well as other selective factors, may account in part for this shift in 
relative standing. Tn all years, however, the girls did much better on the vcibal than 
on the numcrKul parts of the test, while the rcvcise was true of the boys. Cl. College 
Entrance Examine uori board. Annual Reports of the Dircciof, 1935 1948. 
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(ACE), sex differences in total L-score, based upon the three verbal 
sub-tests, arc negligible and inconsistent from year to year. Some of 
these linguistic sub-tests may, however, depend to a considerable 
extent upon general information, in which boys consistently excel. 
Studies on Negro college students (13). as well as on white, Chinese, 
Japanese, and part-Hawaiian high school graduates tested in Hawaii 
(52), showed a highly significant female superiority in the artiticial 
language sub-test of the ACE. This differenc e appeared con- 
sistently and reliably in all sub-groups. The artificial language test 
attempts to measure the ability to handle linguistic relations, inde- 
pendently of the individuafs general information. Given a short 
vocabulary and a few simple grammatical rules, the sul)jcct is re- 
quired to “translate” a short English passage into this artificial 
language. 

Girls also excel in most tests of memory^ although the differences 
are neither so large nor so consistent in this respect as they are on 
verbal tests. In the standardization sample of the 1937 Stanford-Binet 
(59), a significantly greater percentage of girls passed the tests of pic- 
ture memories and copying a bead chain from memory. No significant 
difference in fa\or of either sex was found on other ^icmory tests 
in the scale." Group tests of intelligence also tend to show superior 
female performance on sub-tests iinohing memory (9, 69, 86). In 
digit span and in memory for geometric forms, liovvexer, sex dif- 
ferences arc negligible and inconsistent (<86). In memory for iiarra- 
ti\es, the direction of sex differences often depends upon the lelalivc 
appeal of the content for the two sexes. 

In general, however, w'hcn the content fa\ors neither sex, girls 
tend to excel more consistently in logical than in rote memory. This 
may result from the greater dependence of l(\iiical memory tests upon 
verbal comprehension. It is possible, in fact, that the female superi- 
ority in many memory tests is attributable to the role of verbal 
functions in iacilitating retention and recall of most types of material. 
Another relevant observation is that women seem to have more \ivid 
menial imagciy than men in every sense modality. This finding, first 
suggested by Cialton on the basis of his famous “breaklasl table” 
questionnaire (24), has been subsequently corroborated by several 

® This test was not ict.nned in ihc more tecent foinis of th^‘ ACE. 

With the exception of memory loi a stc'iy about aciohats, which obviously 
appealed muie to the interests ol bo>s than to those of puls. 
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investigators. To what extent such a dilTcrcnce may be the result of 
sex differences in occupations and other traditional activities remains 
to be seen. 

A difference in favor of the male sex has been repeatedly observed 
in various phases of spatial ami juechankal aptitude. The possibility 
that this difference has a predominantly cultural basis, however, is 
suggested by several facts. \ luis male superiority is more pronounced 
and consistent in tests depending upon mechanical infoimation than 
in the more abstract tests of spatial relations, which may be equally 
unfamiliar to both sexes. Moreover, male superiority in this area is 
not evidenced as early as was the female superiority in \crbal apti- 
tude. For example, in the extensive observations by Cicscll and his 
associates at Yale, no significant or consi^tent sex differences were 
found during the first five years of life in tests involving block build- 
ing, form boards, and form recognition (25). On the other hand, in 
a study of 100 children between the ages of 2 and 4 with the Wallin 
Peg Board, Goodenough (29) reports a difference in favor of boys. 
In this lest, the subject is required to insert round, square, and tri- 
angular pegs into the appropriately shaped holes as rapidly as possible. 
The bo>s obtained a significantly higher average on this test, despite 
the fact that in the same group the gills excelled significantly in Kuhl- 
mann-Binet IQ. In the light of the negative findings in the Gesell 
observations, it is likely that the sex differences reported on such 
isolated tests may result from differences in the play experiences of 
the particular groups of bo>s and girls studied. 

Among children of scrool age, a clear-cut sex differentiation on 
mechanical tests is already apparent. On the Stanford-Bincl, boys 
were found to excel significantly in block counting from pictures, 
directional orientation, and plan search, all of which piobably 
involve spatial abilities (59). On such tests as form boards, puzzle 
boxes, assembling objects, and slot mazes, boys also score much 
higher than girls in both >pecd and accuracy. A similar male supeii- 
ority w^as found by Portcus (68) in hL graded papcr-and-pencil 
mazes. Boys clearly excelled on theve mazes, when compared with 
girls of the same Stanford-Binet IQ’s. 

Interesting sex differences at two age and educational levels were 
reported in connection* with the standardization of the Minnesota 
Mechanical Aptitude Tests (66). Seventh grade boys and girls, as 
well as college sophomores, were employed in these comparisons. In 
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Table 38 will be found the critical ratios of the differences between 
male and female averages on each of the tests in the battery. The 
largest and most consistent sex difference is noted on the Assembly 
Test, which requires the assembling of a number of common objects, 
such as a bottle stopper or a spark plug, from the given parts. The 
greater experience boys with mechanical objects undoubtedly gives 
them an advantage on such a test. The Paper Form Ik^ard Test, in- 
volving more abstract spatial visualization, shows a male superiority 
which falls short of that required at the .01 level of confidence. I hc 
Spatial Relations I'cst calls for the insertion of numerous irregularly 
shaped pieces in their appropriate recesses as rapidly as possible. This 
test, together with Block Packing and Card Starting, favors girls in 
accordance with the commonly reported sex difference in manual dex- 
terity and perceptual discrimination. It should be noted, however, that 
the female advantage on these tests disappears in the college sampling, 
either because of selective factors or because of intervening experi- 
ential differences. 

TABLE 38 Critical Ratios of the Differenres between Male and Female 
Averages on the Minnesout McLhanical Apntiuie 1 e\ts 
U'loni I'.ititsou tl it! , »io, ji JT-*) 


Test 

( lineal RiiUo 

t uUcttc SophonvHC'i 

Seventh (iiadc Biipils 

Assembly 

12. 1 

10.4 

Paper 1 orm Board 

2 0 

2 4 

Spatial Relations 

-3 2 

2.4 

Block Backing 

- s 0 

1 4 

Caid Sortnig 

-8.9 

-0.6 


In ihis tabic, u minus Mgn nidicales a diflcicnce m favor of the girls. 


On tests of mechanical comprehension, women score lower than 
men, as would be expected on the basis of sex differences in mechan- 
ical information and experience. This is illustrated by the results of 
a survey with the Bennett Test of Mechanical Comprehension (Form 
AA), given to 390 females and 338 males of comparable age and 
education, including high school and adult groups (6). The males 
averaged much higher than the females on this test, the critical ratios 
of the differences ranging from 7.2 to 10.5 in different groups. Al- 
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though the sex differenee varied considerably from problem to prob- 
lem, the females as a group made more errors than the males on 
every item. 

Mention may also be made of sex differences on performance tests 
of ‘‘intelligence,” many of which depend largely upon spatial rather 
than verbal aptitudes. Such tests generally favor the boys. For ex- 
ample, in the complete sampling of Scottish children previously de- 
scribed, a battery of eight performance tests, selected from well-known 
intelligence scales, was administered in addition to the Stanford-Binet 
(55). The total performance score showed a significant difference in 
favor of the boys, the critical ratio of the differences being 3.74. 

On numerical tests, the largest differences are again in favor of 
boys. Such a male advantage fails to appear, however, until the chil- 
dren are well into the elementary senool period. Geseirs observations 
on preschool children show cither negligible sex differences or a 
slight superiority of girls in the early development of numerical con- 
cepts (25 ). Extensive surveys on kindergarten and first grade children 
have also yielded no significant sex difference in arithmetic abilities 
(11, 99). At the lower levels of the Stanford-Binet, sex differences 
on tests involving counting and number concepts arc likewise negli- 
gible or inconsistent (86). 

Among elementary school children as well as older subjects, com- 
putation tests show cither no sex difference or, more often, a dif- 
ference in favor of girls (86). On arithmetic problems and other 
numerical reasoning tests, males excel quite consistently (86). In the 
1937 Stanford-Binet (59), boys excel significantly on the tests of 
arithmetic reasoning, ingenuity (a more dilTicult type of numerical 
reasoning problem), and induction (in which a generalized numer- 
ical rule must be found). In the previously cited studies with the 
Pressey Group Test of Intelligence (9, 69), the boys excelled on the 
arithmetic reasoning test at each age in the elementary school group, 
as well as in the high school senior group. On the Army Alpha given to 
834 high school students, the boys excelled significantly in only three 
tests: arithmetic reasoning, number series completion, and informa- 
tion (95). Ihc differences on these three tests were sufficient to pull 
up the total scores and produce a difference in favor of the boys on 
the scale as a whole. 

The scores of college freshmen reproduced in Table 37 show a 
highly significant difference in favor of males in the mathematical part 
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of the Scholastic Aptitude Test (10). The critical ratio of this dif- 
ference is over 15, indicating virtually complete certainty that the 
diiTerencc could not have arisen from sampling llucluations. Similar 
differences in favor of males were found in surveys with the American 
Council Psychological Examination (ACE) on American Negro stu- 
dents (13) and on white, Chinese, Japanese, and part-Hawaiian high 
school graduates tested in Hawaii (52). In all these groups, the 
arithmetic test yielded significant and consistent differences in favor 
of the males. The annual ACE norms for American colleges also show 
a significant difference in favor of males in total Q-score, based upon 
arithmetic reasoning, number series completion, and figure analogies. 
The first two tests arc numerical and the third spatial in content, in 
the 1947 norms, based on 30,924 males and 24,918 females, the 
mean Q-scorc was 44.39 for males and 41.40 for females. The critical 
ratio ^ of this diff erence is 30.92. 

An interesting recent development in the study of sex differences 
has involved the comparison of boys and girls on the Chicago tests 
of Primary Menial Ahiliiics, designed by Thurstonc on the basis of 
factor analysis (cf. Ch. 15), In one survey (38) on eighth and ninth 
grade pupils, the girls excelled significantly in wx>rd llucucy. icason- 
ing, and visual memory, while the boys were significantly superior in 
spatial orientation. No other sex differences were found to be clearly 
and consistently significant at both grade levels. Jn another study ( 35 ), 
tlie same tests were given to all 13-year-olds in a small rnidvvcslcrn 
town. This group included a total of 4i) boys and 51 giils, and ranged 
from the fourth to the ninth sclu'iol grades, although most of the 
children were in tlie eighth grade. The spatial test w'as again the only 
test on which a diiTerencc approaching statistical significance favored 
the boys. 'The girls weic significantly superior on the reasoning and 
number tests. They also excelled in word fluency and visual memory, 
but the slgnificajice of these differences was low^ The two sexes were 
practically ccjiial on the verba! comprehension test. 

It is probably premature to draw any conclusions regarding sex 
differences on these tests of “primary mental abilities,” but on the 
whole the findings corroborate those obtained with other tests. The 
boys arc significantly superior on the spatial aptitude tests. The girls 
excel consistently in word fluency and visual memory The number 
test, which also favors the girls, consists exclusively of arithmetic com- 
Compiiled by the writers from duta given in 89, p. 14. 
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putation, in which girls have previously been found to excel. Similarly, 
female superiority in the reasoning test may possibly be related to 
the fact that the test consists of letter series to be completed. Girls 
may have more facility witli this type of material, in connection with 
their general linguistic superiority. On the other hand, the girls’ 
failure to excel in the verbal comprehension test may be due to the 
contribution of general information to performance on this test. Since 
boys usually excel in general information, this could counieract the 
girls’ superiority in verbal ability. 

Finally, we may consider briefly certain sex dilTercnces in artistic 
abilities. Among preschool children, girls generally include more de- 
tails in their drawings than do boys (25). d his is true in their spon- 
taneous drawings as well as in various controlled drawing tests, such 
as drawing a man or completing ligures. One reason for such a dif- 
ference nuiy he the possibility that girls are more observant ol details, 
another that they spend more time in drawing during early childhood. 
Both of these difTerenecs could in turn result from the fact that the 
activities of girls are traditionally more sedcntaiy and more circum- 
scribed than those of boys. Girls would thus be more likely to notice 
minute details in their surroundings and would also have more 
practice in such relatively sedentary pursuits as drawing. In later 
childhood and adulthood, sex dilTcrcnccs in artistic production or 
appreciation are even more dilTicult to evaluate because of obvious 
diirercnces in relevant training and experience. On such tests of 
art appreciation as the McAdory and the Meier Art Judgment Test, 
women exceed men in c. .erage scores by small but fairly significant 
amounts (20). 

Jn the Seashore Tcsls of Musi- 1 Talent, which measure relatively 
simple auditory discrnninaticm a.,d memory, no sigrfdicant sex dif- 
ferences have been iound (21). On more complex tests, placing 
greater emphasis ui)on aesthetic appreciation, the scores generally 
fa\'or vv(U 7 icn. An inferesiing clue to the probable origin of some of 
these dilTcrcnccs is pnivided by an investigation on college students 
wdth the Kwalw^asser-Dykema mi-': tests (26). Comparisons were 
made belw'ecn men and women within a group of 1000 students in 
twx'lvc eastern colleges. In tlie total undin’ercntialed groups, women 
excelled in average score. But when only subjects who had received 
no musical training were compared, the sex di^Tcrence disappeared. 
These findings thus suggest that the sex differences ordinarily reported 
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on such tests may result from differential amounts of training re- 
ceived by the two sexes. 

SCHOOL ACHIEVEMENT 

On the whole, girls excel in general school achievement, as revealed 
both by achievement test results and by school grades, l^erformancc 
on the separate parts of standardized achievement tests, however, 
shows a hierarchy of abilities in different school subjects which cor- 
responds closely to that found with tests of intelligence and special 
aptitudes. Corresponding sex differences have been found in the ex- 
pressed preferences for difierent school subjects among elementary 
and high school students (86, pp. 964-966). The same hierarchy of 
academic achievement has been reported consistently from the elemen- 
tary school (37) to high school (43) and college (79), and from 
morons (67) to gifted children (85). On such tests as the Stanford 
Achievement battery, given to many thousands of elementary school 
children in a number of independent surveys, the boys excelled in 
arithmetic reasoning, nature study, science, and history; the girls, in 
reading, language usage, spelling, and arithmetic computation. These 
sex differences in achievement persist when boys and girls arc equated 
in general intelligence. For example, in an investigation on high 
school students, 410 boys and 349 girls were given both the Terman 
Group Test of Intelligence and a standardized geometry test (93). 
I'hc average geometry test score of the boys was higher than that of 
the girls, when the two sexes were equated in intelligence lest score. 

Some of the largest sex differences in achievement test scores have 
been reported on science tests. In a survey (43) of North Carolina 
high school seniors, including approximately 8000 boys and 11,000 
girls, the critical ratio of the difference in favor of the boys was 31.7 
on the science section of the achievement test employed.-’ Similarly, 
in a college survey (48) on 2992 men and 1410 women, the mean 
difference in natural science scores was about 24 times as large as 
its standard error. In the annual Science Talent Search, sponsored 
by Westinghouse, boys again averaged higher than girls (18). 

In summary, girls surpass boys in those school subjects depending 
largely upon verbal ability, memory, and perceptual sp xd. Boys excel 

^ One of the reasons for this exceptionally large critical ratio is, of course, the size 
of the samplings. 
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in those subjects which call into play numerical reasoning and spatial 
aptitudes, as well as in certain 'information” subjects such as history, 
geography, and general science. This is in agreement with the common 
superiority of boys on tests of general information included in intel- 
ligence scales, and probably results from the less restricted and more 
heterogeneous environment to which boys arc exposed, as well as 
from their wider range of reading interests, i’erman (85), for ex- 
ample, in his survey of the reading habits of gifted children, reports 
that the girls read imaginative and emotional liction as well as stories 
of school and home life far more often than the boys, while the latter 
showed a predominant interest in books on science, history, biogra- 
phy, travel, and iriiormational fiction and adventure talcs. 

In regard to school prof^rcss. girls are consistently more successful 
than boys. The difTcrenccs, although small, appear irrespective of the 
particular criterion of school progress employed (51, 52, 71). Girls 
are less frequently retarded, more frequently accelerated, and pro- 
moted in larger numbers than hoys. Typical results from a survey 
conducted in the schools of 318 cities are shown in Table 39. Since 
girls make more rapid progress than boys in school promotions and 

TABLE 39 Median Pcrccntap^c of Hoys and Girls in Normal A^e-Cirade 
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since most comparisons of achicsLiucnt arc made on the basis of 
school grade, it follows that age comparisons would show an even 
greater superiority of girls. 

In school grades girls excel ccmsistently, even in those subjects 
which favor hoys. Thus a comparison of grades in arithmetic, or his- 
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tory, or any other subject in which boys obtain higher achievement 
test scores, shows a sex dilTcrcnce in favor of girls. The advantage 
enjoyed by girls in school grades was made particularly vivid in an 
investigation (50) on 202 boys and 188 girls in grades 2 to 6, all of 
whom were given the Stanford Achievement Test. The girls were found 
to excel consistently in school grades, ^'hen compared with boys re- 
ceiving the same achievement test scores. Thus the grades showed 
a far greater female superiority than seemed to be warranted by 
performance on objective achievement icsts. 

Similarly, high school girls generally obtain better grades than high 
school boys, even though the latter are a more select group and make 
a better showing on achievement tests (cf. 53). 1 his is illustrated by 
a survey of the grades given to students in each of the four years 
of a single high school, the results of which are shown in Table 40. 
It will be noted that in each ^ear, without a single exception, the 
percentage of A’s and B's is larger and the percentage of D’s and 
F’s is smaller among tlic girls than among the boys. The larger per- 
centage of boys than girls who left school turthcr suggests the better 
adjustment of girls to the school situation. 
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V'arions explanations have been olTcrcd for the greater academic 
success of girls. Among the major factors may be mentioned girls’ 
demonstrated superiority in linguistic aptitude, which probably plays 
an important part in nearly all school subjects. Current methods of 
instruction, as well as metlu)ds of testing, are predominantly verbal. 
The child who expresses himself well, furthermore, w'ill impress the 
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teacher as being relatively brighter than one who is linguistically 
backward, and this may in turn affect their respective grades. Another 
possible factor in the higher academic ratings of girls is the neatness 
and general superiority of their handwritnu^. In most investigations 
on both elementary and high school groups, girls have been found 
to excel markedly in the quality of their handwriting, as judged by 
standardized product scales (cf, 51, pp. 72 77). Such a didercnce 
may well affect the grades on school examinations as well as on 
written assignments. 

Owing to the obvicuis presence of a subjective element in school 
grades, it is probable dial pei soiiallty difji') oicc^ between boys and 
girls also inllucnce the allotment of such giades. The importance of 
this factor has been emphasized by several investigtitors (53, 71). 
Ciirls are generally nu^re docile, quieter, not so subject to out-of- 
schcKd distractioiis, less resistant to school discipline, and are less 
often “behavior problems'’ than boys. This difference in the child's 
attitude toward school affects his grades both through the amount 
of material actually learned and, more directly, through the general 
impression created on tlie teacher. 

1N1KRI:S7S, RRld I RLNC'l S, AND AT TITUDFS 

That definite personality differences exist between adult men and 
women in our society is clearly apparent from everyday observation. 
Jn many emotional and s(>cial characteristics, this differentiation is 
noticeable from an ea.iy age. An important aspect of personality 
developiucnt in which traditional sex differences are manifested in- 
cludes interests, preferences, id. ds, attitudes, and persona! sense of 
values, dhese chanicteristics, because of their relatively subtle and 
persistent nature, ollen exert an unsuspected influence, not only upon 
the development of emotional and character traits, but also upon the 
individual's achievements and effective abilities. 

Data on sex differences in interests and attitudes arc available 
from a wide variety of sources. I ‘ivccialiy plentiful is the information 
gathered on children (cf. 86). The preferences of boys and girls have 
been compared in such areas as play activities, spontaneous drawings, 
the choice of topics for written compositions, collections, reading, 
movies, radio programs, favorite characters in fiction or in public 
life, vocational choices, and general life goals. Fairly clear-cut and 
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consistent male and female interest patterns have emerged from these 
varied studies. 

A few typical investigations will serve to illustrate these findings. 
In their study of the play activities of 554 gifted and 474 unsclectcd 
children, Terman et al. (85)' computed a “masculinity index” for 
each of 90 common plays, games, and activities of childhood. For 
each activity, this index was based upon the relative knowledge, 
interest, and participation of boys and girls. Among the most “mas- 
culine” activities in the scale are listed: Uk)Is, shooting, kites, bicycling, 
marbles, wrestling, boxing, football, tops, machinery, baseball, and 
fishing. At the extreme of “feminine” activity arc: dolls, dressing up, 
hopscotch, cooking, playing house, playing school, knitting or cro- 
cheting, dancing, sewing, playing store. In what is probably the most 
extensive collection of data on children's play activities, L.ehrnan and 
Witty (49 ) questioned approximately 17,000 urban and 2000 rural 
children. In general, they found that boys engage more often in 
active vigorous play, in activities involving muscular dexterity and 
skill, and in highly organized and competitive games. The play of 
girls tended to be more sedentary, conservative, and restrained in 
range of action. Observations on children in kindergatten and the 
primary grades have shown that boys devote much more time to play- 
ing with building material, gids to painting and modeling (22). 

A number of surveys hstvc been conducted on sex din’crenccs in 
expressed reading pteferem es^ as well as in the books actually used 
in libraries. In order to avoid the possible effects of the child's read- 
ing skill and of his previous familiarity with certain books, some in- 
vestigators have employed lists of fictitious book titles. In one such 
study ((S8), about 200 children in grades 6 to 8 were asked to indi- 
cate their interest in 80 annotated book titles. The titles making the 
strongest appeal to the boys were concerned with violent adventure, 
travel, exploration, stories about boys, and biographies of men. For 
girls, the most popular subjects were love and romance, mild adven- 
ture stories with a child hero, descriptions of specifically feminine 
activities, and biographies of women. Sports on the whole interested 
boys more than girls, although the sex difference depended in part 
upon the specllic sport. Girls chose stories about boys somewhat 
more often than boys chose stories about girls. Such rr ading interests 
reflect differences in the relative maturity of boys and girls, as well 
as more enduring sex differences in general interests. In a similar 
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survey conducted on Swiss school children (4), the principal in- 
terest of boys was the adventure story, that of girls the family story 
and biography. These sex ditfcrcnces in reading interests have been 
closely corroborated by extensive studies of the fnovie (62) and 
radio (19) preferences of boys and girls. 

The vocational choices reported by children and adolescents reflect 
a similar dichotomy of interests between boys and girls. In one ques- 
tionnaire survey (61), high school students were asked about their 
vocational preferences in terms of seven general job characteristics, 
rather than in terms of the traditional job classifications. Significant 
sex differences v/ere found in each of the seven categories, the critical 
ratios ranging from 4.00 to 12.13 (cf. 86, p. 967). A greater per- 
centage of girls expressed a preference for work entailing little re- 
sponsibility, ccmducted indoors, and dealing with people rather than 
things. The boys more often chose work which involved: calmness 
rather than enthusiasm; risk or discomfort, but compensated by higher 
pay; planning versus carrying out another's plans; and directing versus 
following. It should be noted, of course, that such vocational prefer- 
ences may merely reflect the student^s realization of sex differences in 
vocational opportunities, ralher than being a genuine expression of 
personal interest. In a comparison of the "'areas of life concern'* 
which high school students ranked highest for discussion and reading 
in school, certain sex differences were found which increased in late 
adolescence (82). The boys gave the highest ranks to discussions of 
physical health, safety, and money, and showed a more openly ex- 
pressed interest in sex. 1 he girls were more concerned about personal 
attractiveness, personal philosophy, planning the daily schedule, men- 
tal health, manners, personal qt/aliiies, and home and family rela- 
tionships. 

Investigations conducted on adult groups by a variety of techniques 
reveal similar sex ditfercnces in interests and attitudes. A number of 
investigators have analyzed the conversations of men and w^omcn by 
a method which may be unceremoniously described as “eavesdrop- 
ping.'’ Observers systematically :‘"c»^rded the topics of conversation 
overheard in New York’s theatre district (63), on a midwestern col- 
lege campus, in churches, hotel lobbies, streetcars, and other public 
places (47). A similar survey was made on two busy London streets 
(46). In another survey, three observers tallied the topics of con- 
versation overheard during the ten-minute intermission at nineteen 
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concerts, over a six-month period (15). The results of this study, 
based on 601 samples of conversation among adults, were typical 
of those found by all the other investigators. Although the locale 
does to a certain extent determine the topics of conversation, the 
principal sex differences are quite consistent. Money, business affairs, 
and sports are more common in conversations between men; other 
women and clothes are more common in conversations between 
women. Moreover, women converse to a significantly greater degree 
than men about people. 1 he conversations of mixed groups tend to be 
dominated by topics either of equal interest to both sexes or of little 
interest to either. 

In a typical survey (12) on newspaper readinff^ the investigators 
found much in common between the reading interests of men and 
women. In reference to such differences as they did find, however, 
the authors concluded that women “tend to slight the things about 
news that matter in the social sense, and are most interested in the 
commonplace, ephemeral and human-interest sides of life.” Sex dif- 
ferences in interest for different types of activity are indicated by such 
interest tests as the Kuder Preference Record (91, 92), designed 
especially as measures of vocational preferences, On^the average, 
males show stronger preferences for mechanical, persuasive, com- 
putational, and scientific work. Female averages indic.ite greater in- 
terest in the literary, musical, artistic, social service, and clerical 
areas. Similar differences between men and women in general have 
been found on the Strong Vocational Interest Blank (81). On the 
other hand, groups of men and women engaged in the same occupa- 
tions showed very similar interest patterns (77). Thus women physi- 
cians or life insurance saleswomen resembled men physicians or life 
insurance salesmen much more closely in their interests than they did 
housewives. 

Significant sex differences have been obtained on the Allport- 
Vernon Study of Values (14). A psychograph conslrueted from the 
average scores of 1 161 men and 1592 women is given in Figure 91. 
Women's responses rate highest in the aesthetic, social, and religious 
values. This suggests that the immediate enjoyment of artistic experi- 
ences, a concern for the welfare of other people, and an emphasis 
upon spiritual values may be relatively important in ^hc life goals of 
women. The men’s psychograph shows peaks in the theoretical, cco- 
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nomic, and political values. Such a profile indicates an interest in 
abstract knowledge and understanding, a drive for practical success, 
and a desire for prestige and power over others. These sex dif- 
ferences are not, however, very large, and the overlapping is con- 


60 


Significantly J 50 
high 1 


Scores from 
81-28 are 
average 


Significantly j 
low 


40 


30 


20 


10 



00 


50 


40 


30 


20 


10 


Tiico- 

retic<il ! 

1 

Eco- 

nomic 

thetic 

Social 

Polit- 

ical 


Relig- 

ious 


■ Men(N = 1163) ----- Women (N=: 1592) 

Fig. 91. C ompo^le I’loliiC'. ol Aoult Men anti Women on the Allport- 
Vernon Stud) of X'aloos. (Data Irom C .inirii and Allporl. 14, p. 2('>().) 

siderablc. Far larger dilTerenccs on this test have been IoiukI between 
different occupational grtaips of the ^an^e sex, than bel\\'ecn men 
and women in general. Averages as low as 21 and as liigh as 49 
have been obtained by men in ci/'.rcnl occupational groups, while 
the averages of men or of women as a grou[> do not drop below 27 
nor rise above 33 on any one value score. As in the case of the 
Strong Vocational Interest Te^l, oeeupalion seems to introduce more 
of a difference in thee scores than docs sex. 
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SOCIAL AND EMOTIONAL CHARACTERISTICS 

Social Adjustment. Certain consistent sex differences have been re- 
ported in the adjustment to social mores and restrictions, as well as 
in other aspects of personality commonly designated as character 
traits. In an extensive scries of tests by Hartshornc, May, and Shuttle- 
worth (33) on approximately 850 elementary school children in three 
cities, significant differences in favor of the girls were found in moral 
knowledge and social atiiiiides. Several tests of each of these aspects 
of character development were employed. In order to keep as close 
as possible to the children’s own opinion, the tests were worked up 
in the form of ballots and the children were asked to '‘vote'’ on each 
item. In the so-called duties test, for example, several propositions 
were given with the request that the subject indicate whether it is his 
duty to do these things, by underlining Yes, No, or S (sometimes 
yes and sometimes no). Some of the items in this test were as follows 
(33, pp. 46-47): 

1. To help a slow or dull child with his lessons Yes S No 

2 To call your lcachcr\ attention to the fact if you ^ 

received a higher giadc than you deserved Yes S No 

3. To smile when things go wrong Yes S No 

4. To report another pupil if you see him cheating Yes S No 

In total scores on both the moral judgment and social attitudes tests, 
the differences in favor of the girls were 4.31 times as large as their 
standard errors and can tliercfore be regarded as highly significant. The 
investigators concluded that: “It appears on the surface at least that 
girls arc more sensitive to both conventional and ideal social stand- 
ards than boys” (33, p. 1 19). 

Significant sex differences were also discovered in certain objective 
behavioral tests of character. In a series of investigations by Harts- 
horne. May, and Mailer (31, 32), tests were devised in the follow- 
ing areas. ‘V/ccr//,” including cheating, lying, and stealing; "‘service,” 
including cooperative and charitable behavior; and ‘‘self-control ” 
including persistence and inhibition. Among the special advantages 
of these tests may be mentioned the fact that the subjects did not 
realize that they were being tested or that their actions could be 
detected. All observations, furthermore, were made in the course of 
ordinary everyday activities of the children, including school work, 
homework assignments, athletics, and party games. Data on deceit 
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were collected on some 10,865 elementary school pupils in several 
parts of the country. For the main studies on service and self- 
control, about 900 children were employed. 

TABLE 41 Sc.x Dtffo cticcs in Co tain Character 
T I aits 


lAdnpUd fuaii llaitshonu, and Mallti, 

pp I'io. JSO, 382) 


M I'lisitrc li nip toyed 

Dill/' j ci,ii ff 

Tot 111 service score 

1.9 

RepuliUion for service 

7.9 

'Ibtal persistence score 

1.7 

Reputation for persistence 

7.6 

Total inhibition score 

5.5 

Reputation for inhibition 

5.0 


All clifltM elites liivoi the giils. 


No consistent sex difference in deceptive behavior was found. 
Analysis of separate tests showed that boys tended to he more honest 
in some siiiuUions, girls in others. In the studies of service and self- 
control, sex comparisons were made in both test scores and “reputa- 
tion'’ among classmates and teachers. Summary data on sex differ- 
ences in tliese tests arc given in Table 41. It will be noted that all 
of these differences favor the girls. In service and persistence, how- 
ever, the differences in 'otal scores were not significant. The relative 
standing of the two sexes in these areas also varied from one test to 
another. Persistence scores depended largely upon the appeal of the 
specific subject mattei for boy^ or girls. Other investigators have 
also obtained conllicfing results with persistence tests. In inhibition, 
on the oilier hand, girls were significantly superior in total score and 
consistently superior on each separate test, d'he more successful adiust- 
ment of girls to the scliool situation may be partly the result of such 
a personality difference. 

It is also interesting to note in this connection that in reputation 
the girls excel the boys markedly in all traits. This too may intiuence 
their school success. The discrepancy between reputation and per- 
formance is likewise of interest in relation to social pressure. It may 
be that with increasing age the cumulative force of social expectancy 
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becomes more effective and the discrepancy between behavior and 
traditional belief is lessened. With this would come an increasing 
differentiation between the sexes. Until similar behavior tests are 
made on adult subjects, these questions cannot be answered. 

Another source of data on social adjustment is provided by sta- 
tistics on crime and delincjuency. Such records must, of course, be 
interpreted with considerable caution, since opportunities for crime 
are very different for the two sexes. Moreover, the dillcrential treat- 
ment of the two sexes by the courts is clearly apparent. For most 
crimes, the available statistics probably underestimate the frequency 
of occurrence among W'omen. The one exception is sex delinquency, 
which is judged with less leniency for women than for men. Whatever 
the reasons, however, the discrepancy betw^een the crime records of 
the two sexes is tremendous. During a typical year, the men sent to 
federal and state prisons and reformaUnies outnumbered the w'omen 
in the ratio of nearly 25; I (cf. 73, p, 245). A similar ratio was found 
between ma!c and female convictions in New York State within a 
one-year period. But when the number of arrests was considered, 
the ratio dropped to 19:1 (73, p. 248), Ihe latter finding illustrates 
the differential ticatment of men and women by the cowls. Statislics 
on juvenile delinquency vary widely from one report to another, 
ow'ing to such Fictors as tlie criterion of delinquency and dilTercnces 
in local conditions and practices. All agree, liowever, in showing a 
much greater proportion of delinquent boys than girls (<S()). 

A similar excess ol boys is found among the children icfcricd to 
child giiidance clinics as hehavior pjohlcms (1). Additional data are 
furnished by a number of extensive school survc\'s in wliich teachers 
WTTe asked to supply information about the problem children in their 
classes. The results of all these surveys are in eli^se agreement (86). 
Tn one investigaiion covering ten cities, the ratio of boys to girls in 
the problem group was 4:1 (97), Among the undesirable types of 
behavior reported much more frequently for boys Ilian for girls arc: 
truancy, dcsiruclion of property, stealing, profanity, disobedience, 
defiance, cruelty, bullying, and rudeness (96). Moreover, a larger 
number of undesirable behavior manifeslations per child arc reported 
for boys than for girls (96). To what extent these sex dilTcrenccs 
may be a reflection of teachers’ attitudes toward boys and girls, and 
to what extent they represent real behavior dilTcrenccs, is dilficult 
to determine. That the differences arc at least partly the result of a 
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“sex halo” in teachers’ ratings is suggested by a number of investiga- 
tions of such ratings (cf. 86, pp. 987-989). 

Emoliuiial Adjustment. 'The greater frequency of behavior prob- 
lems among boys is probably related to a more general sex difference 
in c and dot}}ina}it hr/un ior. The origins of this sex diifcrence 

are probably partly ciilliiral and partly biological. Ti will be recalled, 
lor example, that a similar sex difference in aggressive and pugna- 
cious behavior has hcen observed in many species of animals. The 
greater si/e and muscular strength of the male is undoubtedly one 
contributing factor, and the male sex hormone is another. The part 
piaved by the latter is demonstrated by tlie conspicuous changes in 
aggressive behavior following gonadal transplants in animals. 

Whatever its origins, this sex dilference is a particularly persistent 
one in our culture, having been observed, from early childhood to 
adulthood. Studies on nursery school groups have repeatedly demon- 
strated that boys display anger and aggression more often than girls. 
In one investigation (34). lor example, independent raling-- by three 
teachers were obtained tor each of 579 nursery school children. The 
lesLilts indicated that boys more often grab toys, attack others, rush 
into danger, rcfiise to ccMnply, ignore requests, and laugh, squeal, and 
jump around excessively. Cnrls. on the other hand, more frequently 
exhibit withdrawing and introverted behavior, such as avoiding play, 
staying near an adult, seeking praise, and giving in too easily. A 
number of these ditfcienecs were of questionable statistical signifi- 
cance, the critical ratios being under 2.58, At the same time, most of 
the observed sex dillerences were revealed as clearly among 2-year- 
olds as among 4-ycar-olds, a fact which led the author to minimize the 
role of social pressure in the greater aggression of boys. Direct 
observations of preschool children in standardized experimental situa- 
tions have likewise shown a greater frequency of aggressive re- 
sponses among boys (64). Quarrels with other children in kinder- 
garten and elementary school arc also more common among boys 
than girls (cf. 86). 

The administration of persona!** \ tests of the que.stionnaire type 
has indicated similar sex dilTcrences in aggression or dominance 
among students and unsclectcd adults. On the Bernreuter Personality 
Inventory, for example, the average male score in dominance has 
been found to be significantly higher than the female average in high 
school, college, and older adult groups. This test, consisting of 125 



672 


Differential Psychology 


questions, can be scored with six different keys for as many different 
traits.^ The average scores obtained by male and female college 
groups, together with the critical ratios of the differences between 
them, are given in Table 42. The number of cases in these groups 
varied from 144 to 658. It will be noted that in the dominance scale 
the critical ratio of ihe difference in favor of males is 3.77. 


TABLE 42 Sex Differences among College Stiidenis on the BernreuUr 
Personality Inventory 

(iKita from Urtiiieiitci, 7) 


Scale 

Male 

A vet arc 

7 cmale 
A\cta^:e 


ff. Direction oj Difference 

BiN: Ncuroticism 

57.3 

”42 8 

3.15 

Women more neurotic 

BjS: Sclf-sLiniciency 

27.0 

6.8 

5.89 

Men moie self-sufficient 

Bd: Introversion 

”25 6 

-14.7 

3.50 

Women moic introverted 

BiD: Doini nance 

45.9 

30.6 

3.77 

Men moic dommuiit 

FjC: ('onfidcnce 

”51.5 

8.7 

9.62 

Men more sclf-tonficlcnt 

Fi-S: Sociability 

”25.9 

-31.1 

0.88 

Women more gregarious 
and socLilly dependent 


Another area of behavior showing large and persistent sex dif- 
ferences is that of social orientation. Some evidence has already been 
presented — in our discussiqn of interests, preferences, and attitudes — 
which indicates a much stronger interest in people among women than 
among men. This sex difference also appears early in life and continues 
into old age. One possible factor in the greater social interest and 
social orientation of girls may be their earlier language develop- 
ment. Their more rapid mastery of speech would certainly give girls 
an advantage in communicating with other children as well as with 
adults, and would thus encourage activities of a social nature. Of 
prime importance, however, are the subtle social pressures which 
probably begin to operate much earlier than is generally realized. 
Traditional sex roles and sex stereotypes arc almost certain to be 
rcllectcd in the attitudes of parents and others toward the child 
almost from the time of his birth. 

^“The fact that some of these traits have been shown by factor iinalysis to be 
jiorrelatecJ with each other, and that the last two icpiesent common faclois identi- 
fied through the hist four, introduces uniiecessaiy duplication in l^.e scoics, but does 
not invalidate the sex compaiisons made. The six scales simply leprcscnt six 
categories into which ihc responses can be grouped. 

“Computed by the writers from the iioims published by Bernreuter (7). 
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Throughout childhood, sex differences in sociality have been noted 
in a wide variety of situations (42, 86). In the play activities of 
nursery school children, boys show more concern with things, girls 
with personal relationships. Similarly, girls manifest more responsi- 
bility and “motherly behavior’' toward other children than do boys. 
At all ages, girls engage more often in “social” games involving other 
children; they read more books about people and more frequently 
express interest in occupations dealing with people. The girls’ greater 
concern with questions of appearance and manners is indirectly an 
indication of more interest in what others will think of them. Parents’ 
tabulations of the questions which children asked in their presence 
showed a signiticantly greater proportion of questions about social 
relations asked by girls. Nicknames of an affectionate form are more 
common among girls, those based on physical peculiarities more 
common among boys. Girls arc more frequently angered by situations 
affecting their social prestige, and also experience more jealousy. 
Even studies of children's dreams have shown that girls more often 
than boys dream about people of various sorts, as well as about their 
own family and home. 

In interviews with 666 children between the ages of 5 and 12, 
responses involving persons, family, and social relations occurred 
Vvith consistently greater frequency among the girls than among the 
boys (41), Among the children’s “first wishes,” for example, the 
proportion dealing with siblings, companions, or friends was 12% for 
girls and 3% for boys. In descriptions of “the best thing that ever 
happened” to them, 14.>% of the girls and 8.3% of the boys men- 
tioned parental contacts and other personal relationships. Correspond- 
ing percentages for “the worst diing that ever happened" to them 
were 15.6 and 8.0. At the opposite extreme of age, it is interesting 
to note that among persons between 70 and 90 years old, sociability 
showed a high positive correlation with happiness in females, but an 
insignificant correlation in males (42). 

An important distinction has been made by Johnson and lerman 
(42) in their discussion of sex udNrences in social orientation. Data 
collected on 3000 college students suggested that women do not 
actually behave more socially, although they desire more strongly than 
men to be social, 'fhe overt expression of social interests in the female 
is more often inhibited by timidity and lack of self-confidence, as 
well as by niore specific culturally imposed restrictions. The previ- 
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ously discussed sex difference in aggression probably affects social 
participation. As for self-confidence, a number of studies have indi- 
cated that women rale lower than men in this regard. For example, 
the largest and most highly significant sex difference reported in Fable 
42 was found in the confidence scale of the Bernreutcr Personality 
Inventory. The critical ratio of this difference is 9.62. A similar dil- 
fercnce in favor of males, with a critical ratio of 5.89, was obtained 
in the self-sufficiency scale. On the other hand, the lack of significant 
sex difference in the ‘'sociability"’ scat,' of the same inventory may 
be due to the inclusion of two aspects of socia.bility showing opposite 
sex differences, vi/., social interest and social participation. 

The distinction between social interest and social participation may 
also help to explain some of the ambiguities in the results on sex 
differences in introversion. We have already seen (Cii. !5) that com- 
mon tests ol introversion measure more than one “unitary trait.'' he 
factor analyses of Mosier and the Ouilfords suggested a convenient 
tripartite division into “thinking," “social," and “pubiic" introversion. 
For a comparison ol the sexes, a different sort of distinction may be 
necessary, possibly between social orientation or interest and social 
contacts or participation. I timing again to Table 42,^we find that 
women appear significantly more introverted on the average than men. 
vSurveys with the (iiiilford and Martin inventories on high school 
students and rural adults showed no significant sex diff'eienee in “think- 
ing" introversion (30). “Social" introversion gave inconsistent sex 
differences in the high seliool and rural groups. 

d’he first clear-cut evidence that sex differences may cut across the 
traditional introversion-extroversion category is to be found in a study 
by Heidbreder f36). A list of 5-1 “introvert traits" was carefully 
compiled so as to include the behavior most frequently designated 
as characteristically introverted. Self-ratings and ratings by two asso- 
ciates on each of these traits were obtained for 100 college men and 
100 college women. No significant sex difference was found in total 
introversion scores, the averages being 11.41 and 11.12 for the men 
and women, respectively. But the introvert characteristics reported 
most frequently by the men differed from those reported most fre- 


'“I'liis should not be icgarJcd as independent corroboration, Once the two scales 
ovcilap consitlerabi>. It does suggest, however, that it is the self-conlidcncc items 
which account lot the sex diO’eience, ralhct than some other iinidenti/ied aspect of 
behavior which might have been involved in the lest score. 
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quently by the women. A few examples will serve to illustrate this 
difference. 

lypical "'/nascidinc** symptoms' of introversion: 

Outspoken 

Works things out on own hook; hesitates to accept help 
Keeps in background on social occasions 
Conservative and painstaking in dress 
Introspective 

Typical jcnnninc” symptoms of intros ersion: 

Shrinks when lacing a crisis 
Works by tits and staits 

H as ups and downs in mood without .^npaient cause 
Feeis hurt readily 

Hesitates in aiaking dccisuins ('.n ordinary ma Iters 

Another point should be considered in evaluating sex diflcrcnces 
in introversion among college students, d here is some evidence sug- 
gesting that introversion scores tend to be pc^sitively related to 
academic success among hoys, but negatively related to academic 
success aiinong girls (ef. 42). 1 hose students who have been suf- 
ficiently successrul in their school w'ork to reach c<.dlege arc thus 
likely to show a smaller sex dilTerence in introversion than is found 
among unselecled adults. Moreover, the environment of male and 
female college students is midoubiedty more similar than that of 
un^elected adult men . -d .vomen In >ummary. what the results on 
introversion show is that neither men nor women can be said to be 
more introNcrted. Any sex difict.mees reported in measures of intro- 
version must be inteipreted wif' leferenee to the particular popula- 
tion studied, as well as tlic spcciftc behavior inanijcstations which 
were sampled. Fxisting sex diherenees can evidently be better de- 
scribed in terms of social orientation than in terms o\ the traditional 
intrewersion-extroversion category. 

A third major personality are . hi which large sex dilTcrences have 
been reported is that of etnolioiial instability or ncuroticis/n. Obser- 
vations of prescliool and elementary school ciiildrcn have revealed 
a somewhat greater fretpiency of ‘'nervous habits,” such as nail- 
biting and thumb-sucking, among girls than among boys (cf. 86). 
It should be noted, of course, that “nervous habits” represent a rather 
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arbitrary behavior category. The greater frequency of “behavior prob- 
lems” among boys may balance the greater frequency of “nervous 
habits” among girls. The total degree of instability might thus be no 
different in the two sexes at these age levels. Girls may simply resort 
to milder and less violent ways of expressing displeasure and malad- 



Fig. 92. Median Number of Symploms Reported by Boys and Girls on 
the Woodworth-Mathews lest ot hmolional Instability. (P'roni Mathews, 
56, p. 21.) 

justment than boys, because of differences in socially imposed re- 
strictions. 

Fear responses have also been found to occur more commonly 
among girls, as determined by laboratory investigations, teachers’ 
reports, and interviews with children (34, 41, 86). A ciilUtralJy de- 
termined sex differenee in the admission of fear may exaggerate these 
differences somewhat. Girls also tend to report more worries, as well 
as emotional responses of greater intensity and to a wider variety of 
stimuli than do boys (cS4, 86). 

On neurotic inventories, clear-cut sex differences in emotional 
instability do not appear until the adolescent years. This finding w'as 
corroborated in a number of investigations with adaptations of the 
Woodworth Personal Data Sheet, specially designed for use with 
children and adcffescents (32, 56, 85, 86). An interesting illustration 
of age changes in this respect is shown in Figure 92, based upon the 
scores of 575 boys and 558 girls between the ages of 9 and 19. At 
age 10, the boys reported a larger median number of neurotic symp- 
toms than the girls on the Woodworth-Mathews Test. With increas- 
ing age, the median number of symptoms tends to rise among the 
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girls, but drops among the boys. Beyond age 14, the sex difference 
is statistically significant and consistently in favor of boys. The increas- 
ing differentiation of social pressures with age is one hypothesis that 
is obviously suggested by such data. 

Among adult groups, sex differences on neurotic inventories are 
large and consistent. Again referring to Table 42, we find a critical 
ratio of 3.5 for the sex difference on the neuroticisni scale. Similar 
evidence for greater female emotionality was found in the previously 
cited Guilford and Martin study on high school students and rural 
adults (30). That such differences are real and not limited to ques- 
tionnaire replies is suggested by a study of a college group (17). 
Students who had taken a personality inventory were subsequently 
interviewed by two experienced counselors. The excess of maladjust- 
ment among the women, as revealed by the interviews, was even 
greater than that indicated by ihc test scores. 

In their analysis of the releva.nt literature, Johnson and Terman 
(42) tend to emphasize constitutional rather than cultural factors 
as a basis for the greater emotional imbalance of the female. Among 
the types of evidence which they cite in support of such a conclusion 
are; (1) the early age at which sex differences in nervous habits 
appear; (2) the persistence of sex differences in neuroticism today, 
despite the trend toward equalization of social pressures; (3) the 
fact that sex differences in emotionality are as great or greater among 
institutionalized blind, deaf, or orphaned children, despite the relative 
uniformity of the instituticmal environments; and (4) the fact that 
peaks of “nervous” bciiavior often coincide with such physiological 
changes as puberty and the menopause. 

All these lines of evidence must be interpreted with caution. 
Puberty and the menopause are periods of acute social crises in our 
society, as well as periods of physiological upsets. Institutional envi- 
ronments are far from uniform for boys and girls. In fact, there is 
no reason to suppose that sex stereotypes arc any different among 
institutional personnel than among any other members of our cul- 
ture. The persistence of sex dihc^^^nccs in neuroticisni in contempo- 
rary society is not surprising. The greater equalization of education 
and the sporadic admission of women to certain predominantly “mas- 
culine” occupations, without the removal of other sources of frustra- 
tion and discrimination, may increase rather than decrease conflict 

Cf. dcfinilion of eiivKOTimenl in Chapter 4. 
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imd maladjustment. As for the excess of nervous habits among female 
children, it has already been pointed out that nervous habits may be 
an insufficient index of emotionality. The evidence does not conclu- 
sively show that the female is the more “emotionaf’ sex in childhood, 
even if we were to grant that the environments of boys and girls 
were equated. 

A “masculinity-femininity index” of personality 

An approach to sex differences which has been used increasingly in 
recent years is the comparison of men and women in those responses 
which have proved to be most charaeteristic of each sex in our con- 
temporary culture. Test items are chosen on the basis of their ability 
to discriminate between tlic responses of the sexes. I'hus if 30% of 
the men atid 29% of the women were to report that they like modern 
art, the item would be discarded because it docs not differentiate be- 
tween the sexes. Similarly, if 76%, of the men and 79% of the women 
dislike walking in the rain, this item is also eliminated. Only those 
items marked by a significantly different proportion ^f men and 
women are retained. 'Ihe resulting test provides an index of “mas- 
culinity-femininity'’ in the sense that it rellects the characteristic male 
and female responses in <H‘ir culture. This approach is illustrated by 
the masculinity Temininity scores on such tests as the Strong Voca- 
tional Interest Blank, the Minnesota Multiphasic Personality Inven- 
tory, and the Intere^i-Attiiudc Analysis prepared by 'J erman and 
Miles. 

It should be noted that such tests are deliberately designed so as 
to exaggerate sex differences, in the same way that such intelligence 
tests as the Stanford-Binet arc designed to exclude or minimize sex 
differences. The behavior of men and women undoubtedly shows 
many similarities. These tests, however, concentrate on the differ- 
ences, since it is their purpose to measure the difierenccs between 
men and women as fully as possible. For any person taking such a 
test, the M-F (masculinity-femininity) index indicates the degree to 
which his responses agree with those most characteristic of men or 
of women in our culture. It i> customary to desigr.ate either the 
masculine or the feminine end of the scale arbitrarily as 4~ or —, 
for purposes of quantification. 
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Sex Differences: Major Results 

The most extensive investigation of characteristic sex dilferences in 
personality is that conducted by Terman and Miles (84). After an 
exhaustive survey of the literature and prolonged research, items were 
chosen which revealed the most pronounced dilTerences between rep- 
resentative samplings of men and women in our society. Data were 
gathered on many hundreds of persons, including elementary school, 
high school, college, and graduate students; unselected adults; mem- 
bers of several occupations; and specially selected groups such as 
athletes, juvenile delinquents, and adult homosexuals. I'he Interest- 
Attitude Analysis, constructed as a result of this research, consists of 
seven parts; Word Association, Inkblot Association, Information, 
Emotional and Ethical Attitudes, interests. Opinions, and Introvertivc 
Response. 

This scale proved ^ery successful in dilTcrentiating between the 
responses of male and female gioups. Signilicaiit sex dilTerences in 
total score were obtained at all age levels, from teen-agers to octo- 
genarians. The critical ratios of these dillerences ranged from 7.2 to 
39.9. Overlapping of male and female distributions wars also relati\ely 
slight, rhe test thus achieved its purpose of selecting those beliavior 
characteristics wiiich dilfercntiaie most clearly betw-een the sexes. 

An intensive analysis of the male ana female responses on each 
part of the test brought to light those aspects of the personalities of 
the two sexes which arc most clearly dilTercnliated in our culture, 
d’erman and Miles summarize these dillerences as follows; 

From whatever angle vve have examined them the males included in the 
standardization groups evinecvl a distinctive into^'cst in exploit and ad- 
venture, in outdoor and physically strenuous occupations, in machinery 
and tools, in science, physical p enomena, and inventions; and, from 
rather occasional evidence, in busu css and cs'/mne ree. On the other hand, 
the lemales of our gioups have evinced a distinctive inleiest in domestic 
alTairs and in .'esthetic objects and occupations: they have distinctly pre- 
t erred more sedenl.iry and imtoor occupatit)ns, aiul oeeupations more 
directly minislrative, particularly to the yoi.ng, the helpless, the distressed. 
Supporting and supplementing these arc the more subjective ditiercnces — 
those in emotional disposition and aeeclion. The males directly or indi- 
rectly manifest the greater selt-assei tion and aggressiveness; they express 
more hardihood and fearlessness, and more roughness of manners, lan- 
guage, and sentiments. The females express themselves as more compas- 
sionate and sympathetic, more timid, more fast-dious, and iesthetically 
sensitive, more emotional in general (or at least more expressive of the 
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four emotions considered), severer moralists, yet admit in themselves 
weaknesses in emotional control and (less noticeably) in physique (84, 
pp. 447-448). 

In regard to the origin of such sex differences in personality, there 
are several lines of evidence which suggest the greater role of cul- 
tural than biological influences. One source of relevant data is to be 
found in some of the group profile comparisons reported by Tcrman 
and Miles (84, pp. 570-579). These profiles, showing the sub-test 
averages of various male and female groups, strongly suggest the 
specificity of differences in masculinity-femininity. Groups with the 
same mean total score may achieve such a score in very different 
ways. For example, among the most “masculine’’ groups in terms of 
M-F index are high school boys and engineers. Both obtained iden- 
tical mean total scores, but the high masculinity of the high school 
boys resulted largely from their interests and information, while that 
of the engineers was primarily due to their emotional and ethical 
attitudes. On the latter test, the high school boys were actually more 
feminine than the general male population. Similarly, groups of de- 
linquent girls and of women artists received mean tota^ scores which 
coincided with the norm for the general female population. The 
delinquent girls, however, achieved this result by a very “feminine” 
performance on the test ot emotional and ethical attitudes, £ind a very 
“masculine” performance on the interest test. The women artists, on 
the other hand, were significantly more “feminine” than the general 
female norms in interests, but significantly more “masculine” in infor- 
mation. Men artists, with a similar deviation in the “feminine” direc- 
tion in interests, as well as in the word association test, received a 
much more “feminine” total score than the male norm, although 
their scores on the remaining sub-tests were at the general male 
average. 

Correlations between M-F scores and physical characteristics have 
been generally low and insignificant (84, Ch. V; 27). Such correlations 
as have been found are probably the result of the social effects of 
certain conspicuous physical characteristics, rather than tlie result of 
underlying biological factors. For example, a slight tendency has been 
found for taller men (84) and for men with deeper voices (27) to 
obtain a more masculine M-F index. Such a correlation may simply 
reflect the influence of social stereotypes upon the development of the 
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individual’s personality. Studies of male homosexuals (84, Chs. XI- 
Xlll) have also indicated that experiential rather than structural 
factors were primarily responsible for the development of homosexual 
behavior. Especially important were early home environment and 
parental attitude toward the individual. 

In the general population, the M-F index has been found to be 
significantl)' associated with education and occupation (84). Illus- 
trative data on occupational groups arc shown in Figure It was 
also found that highly intelligent and well-educated women tend to 
score more "‘masculine” than their sex norms. For example, w'omcn 
listed in Who's Who, as well as those holding an M D. or a Ph.D. 
degree, average more "‘masculine” in total score than any of the 
occupational groups shown in Figure 03. vSimilarly, men who have 
cultivated avocational interests of an artistic or cuilural nature tend 
to obtain more “feminine*' scores. Thus the equaii/ing inlluencc of 
specific training or experience seems to bring about a convergence 
of the temperamental qualities of the two sexes. 

The M-F index seems also to depend upon the (lianc\tic uiilicu 
in which the individual was brought up. Such factois as the death 
of one parent, excessive or exclusive association with one or the 
other parent, and predominance of brothers or of sisters among the 
siblings are much more closely related to M-F score tlian are physi- 
cal traits (S4). Moreover, there is some evidence to suggest that 
deviation toward the norm of the opposite sex in both men and waimen 
is associated with unpleasant and undesirable childhood experiences, 
broken homes, and [>arental maladjustment (23, 78). A pleasant, 
happy childhood, on the other hand, encourages the individual to 
accept the appropriate male or female model of behavior presented 
by his culture. 

To summarize, in personality as in intelligence, we cannot speak 
of inferit)rity and superiority, but only of specific differences between 
the sexes. These dilferences arc largely the result (T cuilural and 
other experlenlial factors, although certain physical sex dillercnces 
may influence behavior development, either directly or through their 
social effects. Lastly, the overlapping in all psychological character- 
istics is such that we need to consider men and women as individuals, 
rather than in terms of group stereotypes. These Ih’-ec points will 
prove to be useful “rules” to observe in understanding other group 
differences to be considered in the chapters which follow. 
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Thk COMPARA71VI i\AiuA'ru)N of tlic raccs of man has long been 
a subject of keen iiUeiest and Inely controversy. It an interesting 
commentary upon human thought that nearly all theories of racial 
inequality proclaim llie superiority oi the particular laec of their 
respective exponents.' i hus Aristotle (cf. 46, pp. 31S 320) en- 
deavored to demonstrate that the intellectual leadership of the Greeks 
must ot necessity billow iioin their favorable geographical location. 
He argued that the pec^ples inhabiting the colder regions of n(;rthcrn 
Lurope, although outstanding for bravery and physical prowess, were 
intellectually incapable ol a high degree oi political organization or 
leadership. Similarly, the Asiatics, although intellectually keen and 
inventive, lacked spirit. The Greeks alone, being geographically inter- 
mediate, were endowed wiih the proper balance ot these traits and 
were thus by nature fitted to rule the earth. Similar claims have been 
made for such groups as the Arabians, the Romans, the French, the 
Anglo-Saxon, the "white” race as distinguished from those having a 
different skin pigmentation, the Nordics, the Alpines, the Mediter- 
raneans, and various otlicrs 

Outstanding among such theories, because of its widespread popu- 
larization, is that proposed by de Gobincau (17) in the nineteenth 
century and subsequently expanded by Chamberlain (12). This doc- 
trine had numerous followers who retormulatcd it and developed it 
along various lines. Its basic contention, however, is the superiority 
of the Nordic or “Aryan” race, .. loosely and ambiguoudy defined 
group whose descendants arc now supposed to inhabit for the most 
part the countries of northern Europe. The array of evidence cited in 
support of this theory is incomplete and one-sided at its best and 

^ tor a readable hisioiical {>urvcy of thcoiies of “lacial superiority/' cf. 
Benedict (2). 
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fantastic and mythical at its worst. The concepts involved in such a 
theory will be critically examined in the course of the subsequent 
discussion. 

Within our own generation, race problems have flared up with 
violent intensity and shocking effects. Outworn and forgotten theories 
have been revived in an attempt to rationalize political actions and 
policies. The Nazi racial doctrines during World War II represent a 
recent illustration of such a perversion of anthropological material. 
Probably the first writer to use the data of physical anthropology 
for nationalistic propaganda was de Quatrefages who, during the war 
of 1870, referred to the Germans as “Huns” (cf. 13). The latter 
term was revived as a derogatory epithet during World War I. 

Under the stress of emotional appeal, it is especially diflicult to 
carry on unbiased and objective analysis of facts. It is one of the 
earmarks of prejudice to draw logically unwarranted inferences from 
the data at hand. A typical testing technique for the measurement of the 
prcjudicc-fairmindedncss variable, for example, is based upon just 
such behavior (cf., e.g., 59). The subject is given certain facts bear- 
ing upon controversial issues, with the instructions to check any of 
the proposed conclusions which seem to him to follow Jirritly from 
the given data, regardless of their truth or falsity in general. The 
individual who is biased or who responds emolionaily to any of the 
issues involved will ignore the limitations of the fads actually pre- 
sented and will generalize far beyond tliem. The procedure in this 
test presents a close parallel to what probably occurs all loo often 
in the interpretation of data on such emotionally toned issues as 
race differences. 

Under such conditions it is especially important to recognize and 
to bear clearly in mind the possible vitiating factors and sources of 
error in the data. As in all group comparisons, studies on race dif- 
ferences must take into account selective jacti^rs and adequacy of 
sampling, overlapping of di.stributions, reliability of an obtained dif- 
ference, inaccuracy or enrhiguity of the measuring instriinicnt, and 
other similar factors which have already been discussed and illus- 
trated fcf. Ch. 18). It is probably not an exaggeration to state that 
failure to consider such factors has invalidated the large majority 
of investigations which purport to have established a racial difference 
in one or another ability or personality characteristic. 

Racial comparisons are an extremely diflicult problem of differ- 
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ential psychology. In addition to the above-mentioned sources of error 
which they share with all group comparisons, studies on race dif- 
ferences are handicapped by special difficulties inherent in every 
phase of the problem. Thus it has proved a diflicult matter in such 
studies to decide whom to measure, what to measure, and how to 
measure it. These difficulties will be analyzed in the present and sub- 
sequent chapter. The first of these two chapters will be concerned with 
questions of whom to measure, or the selection and classification of 
subjects in racial studies. In the second chapter w'ill be discussed some 
of the major problems which arise in the efforts to measure and 
compaie W'idely diverse groups. A third chapter will be concerned 
W'itl) the relative contribution of racial and cultural factors to the 
development of existing group differences in behavior. The special 
experimental designs which have been employed for this purpose 
will be considered and illustrated with typical investigations. 

The data of investigations on race differences have been grouped 
about these methodological questions. No general summary of find- 
ings and no 'intellectual hierarchy” of racial lO's are presented be- 
cause, although apparently useful as mnemonic devices, such tabu- 
lations would be ol dubious value. Isolated facts arc particularly 
misleading in racial comparisons and should at all times be evaluated 
in terms of the conditions under wdiich tliey were collected. Conclu- 
sions on race difierences wall therefore be drawn only in the light of 
a critical analysis of tlie entire problem and wall not be divorced from 
their limiting conditions. 

No attempt has been macK, furthermore, to survey the vast array 
of investigations on psychological differences among racial groups. 
For summaries and more extensive discussions of t]]is problem, the 
reader is referred to such sources as Garth (25, 26), Klincberg (36), 
Mann (44), and others (>6). Special surveys of psychological inves- 
tigations dealing with the American Negro have been prepared by 
Klincberg e/ ul. (37) and Canady (8). For an orientation to the 
general problem of race, books by Boas (5, 6), Kroeher (41), and 
Dunn and Dobzhansky (20) will prove helpful- To obtain a well- 
rounded picture of specific groups, the reader may consult the inten- 
sive field studies conducted by anthropologists, sociologists, and 
psychologists on certain groups, such as C. du Bois’ study of the 
Aloresc in the South Pacific (19), or the investigation of Khiekhohn 
and Leighton (38, 39) on the Navajo Indians. Such studies report 
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psychological test results as well as a detailed analysis of the cultural 
background of the groups. The same approach is illustrated by the 
extensive scries of investigations on Negro youth conducted by the 
American Youth Commission of the American Council on Educa- 
tion (1, 16, 23, 34, 47, 54, 57). Although primarily sociological 
in methodology, the latter studies contain a wealth of psychological 
material. Data were gathered on several thousand Negro adolescents 
in many parts of the United States. Community studies, intensive 
interviews, case studies, and psychological testing were among the 
methods employed in dilfcrent parts of the survey. 

WHAT IS A RACT:? 

Tradition, prejudice, and the snap judgments of everyday observation 
have contributed to the development of a concept of race as a clearly 
differentiated and easily identifiable group, possessing distinctive phys- 
ical, mental, and temperamental characteristics. Fhe observations of 
biologists, anthropologists, and psychologists, however, fail to support 
such a view.- The classification into racial groups is essentially a 
biological one and corresponds to such divisions as breecl. stock, and 
strain in infrahuman organisms. In its simplest terms, any definition 
of race implies a certain cof/ini unity iff physical characteristics based 
primarily upon a common tiered it v. 

The task of race classification is far more complex than would 
appear from the glibness with which individuals are commonly as- 
signed to one group or another. The fivefold classification of races, 
formerly memorized by every school child, is of historical interest 
only. This system can be traced to Linmeus (43), the great classifier, 
who recognized four races of men — Europceiis albus (white), Ameri- 
camis rubescens (red), Asiaticus fuscus (yellow), and Ajricanus 
niger (black). A fifth group, the brown race, was subsequently added 
by Blumenbach (3), who also altered the terminology, proposing the 
now familiar classification into Caucasian, Mongolian, American, 
Ethiopian, and Malayan. This classification is crude and superficial, 
as will shortly become apparent. 

The essential problem in the classification of racial groups consists 
in the identification of inheritable physical characteristics which differ 

^ For n very readable account of many of the dilTiculUcs of race classification, see 
Huxley and Haddon (33). 



Racial Comparisons: Problems of Groupin^ 


693 


clearly from one group to another and which may thus serve as 
criteria of race. A wide variety of such criteria have been proposed 
and applied (cf., c.g., 13, 36, 41). Skin color, although popularly 
employed as one of the most obvious means of racial identification, 
has proved to be one of the least satisfactory of the possible criteria. 
It is a well-established fact that the same pigments are present in 
all human skins and that dilTercnt skin colors result from varying 
relative amounts of each pigment. For this reason, there is found a 
complete series of transition shades, making exact classification very 
difficult. Such a classification is also rendered somewhat unstable by 
the fact that environmental conditions, such as exposure to the sun’s 
fciys, have a marked ellcct upon skin color. 

Pigmentation of the eyes has proved to be a somewhat more 
promising index, in so far as it is unquestionably a hereditary trait. 
In the same connection may be mentioned hair col(n\ These traits, 
however, arc also difficult to describe quantitatively because of con- 
tinuous gradations. A further difficulty in the use of such criteria 
is their relatively narrow distribution, black hair and eyes being the 
universal rule outside of the Caucasian stock. 

In addition to coloring, other characteristics of the hair have been 
employed as dilTerentiating signs. The teMurc of the hair is generally 
regarded as a valuable aid in racial classification. For example, the 
straight, stilT hair of the American Indian is in sharp contrast to the 
woolly, tufl-likc hair of the Hottentot. Fullness of the beard and 
hit safeness, or amount and distribution of hair on the body as a 
whole, have also been employed in such classifications. 

Racial groups have been dilTerentiatcd on the basis of i^ross bodily 
dimensions, chief among which is stature. Group differences in this 
respect are, however, surprisingly small and consequentiy of doubtful 
value in racial identification. Facial and cranial nicasiiremenis have 
been employed to somewhat better advantage. Among the former, 
the most common are nasal index, which expresses the relative length 
and breadth of nose, and various indices of prognathism, or the 
degree of protrusion of the jaws. Ciaiiial capacity, or volume of the 
skull, yields rather ambiguous results because of its dependence on 
general body si/e and because of the wide variation within groups 
with consequent ovei lapping between groups. Cephalic index,*' on 

^Cephalic hKlcx - — | - head vMcUh dcscripiion, see Ch. 12. 

headJongih 
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the other hand, has proved to be one of the most satisfactory criteria 
of classification and is now widely employed. 

In view of the relative paucity of satisfactory anatomical criteria, 
attempts have been made to evolve physiological or biochemical 
schemas of classification. It has been suggested, for example, that 
races might be classified on the basis of blood groups, which have 
become familiar in connection with blood transfusions {ct. Ch. 16). 
These blood groupings refer to the agglutinative reactions of the red 
blood corpuscles, i.e., the tendency of such corpuscles to clump to- 
gether when the blood of certain individuals is mixed with that of 
certain other individuals. In a few^ early studies, the relative incidence 
ot A. B, AB, and O blood types in dilTcrcnl racial stocks was used as a 
basis for racial classification, but the resulting groupings conflicted 
sharply with other criteria of race f3L 41 ). More promising results 
have been obtained by including the recently identified M, N, and Rh 
factors, as well as the various subgroups found for some of the factors. 
The racial classification suggested on this basis seems to agree fairly 
closely with the data on geographical distribution and common descent. 

The endocrine glatids have also played their part in race classifica- 
tion. Likenesses have been noted, for instance, betwccit the physical 
and alleged psychological characteristics of certain racial groups on 
the one hand, and the characteristics associated wiih certain patho- 
logical glandular dysfunctions on the other (cf., e.g., 35). 'Lhiis a 
parallel has been drawn between the cretin and the African Pygmy. 
Pituitary enlargement has been attributed to the Hottentots and 
adrenal deficiency to the Negro, The '^childlike'” .ippcarance of the 
Chinese has been ascribed to an overactive thymus. Such methods of 
classification arc especially questionable for two reasons: they take a 
superficial and partial resemblance as their point of departure; and 
they reason from pathological coiiditiosis existing within a single 
group to the normal characteristics of entire groups. 

Fina.lly, mention should he made of the dlorts to deal with race 
in terms of c(mstitutional type (cf. Ch. 13). Kretschmer (40), for 
example, believed the ratio of Icplosomcs and pyknics to difler in 
various racial groups and olTcrod Ihis as a possible explanation for 
the psychological differences between such groups. Others, both prior 
and subsequent to Kretschmer, have attempted similar classifications. 
The reader should recall in this connection the difficulty of finding 
“pure types"’ and the absence of valid evidence for a correlation 
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between the physical characteristics of such types and any of their 
alleged psychological differentia. 

i: VALUATION OF THE C RITERIA OF RAC C 

In addition to the special dericiencies of individual nicthods of clasd- 
lication discussed in the preceding section, certain nuijor dinicLij'.ies 
are encountered in the application of all, or nearly all, criteria of 
race. In the first place, a wide xariahility exists within any one group 
in respect to any trait. Closely related to this is the marked o\cr- 
lappui:^ between different groups in any of the criteria mentioned„ 
HiLis, although two groups may differ signilicantly in average height, 
individuals can readily be found in the “shorter group” who are taller 
than certain individuals in the “taller group ” d'his obviously makes 
group delineations indistinct and relatively arbitrary. 

A third difficulty is the incousisrency frequently found vvlien more 
than one criterion is employed. An individual might have the coloring 
of a Nordic, the cephalic index of an Alpine, and the stature of a 
Mediterranean. Or very dark skin pigmentation and woolly hair 
might be found in association whth Oaucasiaii features. Such instances 
are frequent and cannot be dismissed as exceptions. 

Finally, it should be noted that many of the alleged racial char- 
acteristics which were formerly believed to be stable and innate are 
being found to be susceptible to cmirotimcntal influences. Even such 
apparently “hereditary” trails as body height, skull shape, and facial 
conformation, have proveu to be dependent in part upon environ- 
mental conditions in early childhooil. 'fhis W'as illustrated in certain 
investigations by Boas (4, 7) on t! e American-born children of im- 
migrants from several Furopean countries. These children w'cre com- 
pared with foreign-born children from the same countries, who vverc 
also living in America. Differences were found in stature, weight, 
and length and width of head. 

The most striking demonstralimi of environmental influence, how- 
ever, was furnished by an examinaie -r, of the cephalic indices of two 
immigrant groups which diller markedly in head shape (4, 7). Ameri- 
can-born and foreign-born boys were compared within an East Euro- 
pean Jew'ish group and a Sicilian group, both Jiving in New York City. 
The Fmmer are characteristically round-headed, having a high cephalic 
index; the latter are characteristically long-headed. As will be seen 
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in Table 43, residence in the new environment tends to make the 
Jewish group more long-headed and the Sicilian more round-headed, 
both groups convciging toward the American norm It will also be 
noted that those bo>s born alter a relatively long period of American 
residence of the mother tend to show a greater change tlian those 
born after a shortci residence period Ihis was also lound to be the 
case in the data on other immigrant groups. 

TABLE 43 Cfianqe in Cephalic Index of Two Immix^rcint Croups 
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strated A compaiison of loieign bom persons who had mimigialcd 
at diifcr^iU periods showed no samihc^ant dilTcr<.nces in the traits 
imde-r consideration 1 he measurement of Amei ican-born ard hnen^n- 
born chiUlrLii of tne same patents, luithei inoie, levcalcd uihcrsiKcs 
m ttiC e'pLcad (hrection 

llic rt Lilts ol Bous hive sub'^equentV Ixcn cnrroboritcd by 
Guthe (2*^)) who compaied the cephalic indiL^^s ol 1 7 Russian- 
born Jewish children and 127 yVmerican-born JlwisIi childien in Bos- 
ton 1 he ecpli ihc indices lound by Hirsch (30) on American born 
childien of Sv'nth Italian, Russian-lew ish, and Swedish parentage 
weie also in genual agreement with the coriesponding figures re- 
ported by Bo. s Similar changes in the shape of the lie id were found 
by Dornteldt (IS) through exUnsivc measurements of migrating 
Jewish groups in JLiirope. 
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Investigations along such lines have also been conducted on Japa- 
nese groups migrating to Hawaii (50) and to the United States (53). 
Spier (53) obtained a series of anthropological measures on 320 
American-born Japanese school children in Seattle, Washington, and 
its vicinity. The same measurements were repeated on 521 school 
children living in those sections of Japan from which most of the 
Seattle group was believed to have come. In general, the Ameriean- 
born children were larger, taller, more round-headed, and had wider 
faces than those born in Japan. Many of the individual comparisons 
ol corresponding age and sex groups yielded statistically significant 
differences between the American-born and native subjects. As in 
the case of the buropcan iinmigrants, the dillcrences tended to 
become more marked the longer the mother had lived in this 
country. 

A variety of factors have been proposed to account for the changes 
in physical type found in immigrant groups. Differences in bedding 
and cradling, as well as the gradual abandonment of swathing cus- 
toms practiced in the mother country, have been cited as possible 
explanations of the changes in head shape. Nutrition and type of diet 
are doubtlessly important factors in all the physical changes noted. 
Alteration in the activities of the endocrine glands under the stress 
of adjusting to a new culture has also been sug^gested as a possible 
factor (cf. 30). Mo.st of these explanations are, to be sure, highly 
speculative. Whatever the specific influence or influences at work, 
however, it is quite clear that they are of an environmental nature. 


A TENTAIIVE ( 1 ASSII 1C AT ION OF RACIAL CROUPS 

It is apparent that no one criterion (^f lacc can yield a satisfactory 
classification. Nor can clear-cut group distinctions be made with a 
combination of such criteria. It should be borne in mind that at best 
any racial classification is approximate. No sharp line of demarca- 
tion can be established between groups, nor can every individual 
be unequivocally assigned to one p.aiicular group. The classification 
which has been most widely used by anthropologists and psycholo- 
gists is one based upon a combination of criteria, chief among which 
are cephalic index, hair quality, hairiness on the body, facial con- 
formation, and bodily proportions. An outline of this classification 
(cf. 41, p. 132) is shown below; 
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1 Caucasian 
Nordic 
Alpine 

Mediterranean 

Hindu 

II Mongoloid 
Mongolian 
Malaysian 
American Indian 

m Negroid 
Negro 
Melanesian 
l»>gmy Black 
Bushman 

IV Of doubtful classification 
Au‘'traloid 
Polv nesian 
Ainu 

Veddoid ( Indo- Australian ) 

Within the C'aiicabian or while nice, four subdivisions arc gen- 
erally recognized. Three of these groups are the Nordic, Alpine, and 
Mediterranean classes into vshich the population of BurcTpc is divided; 
the fourth consists of the Hindus. The Nordics are described as tall, 
blond, blue-eyed, fair-skinned, and dolichocephalic, or long-headed. 
They occupy a horizontal belt around the Baltic and North Seas, 
covering most of England, northern France, the Scandinavian penin- 
sula, Holland, and northern Germany, dhc Alpines, located chiclly 
in central Europe, are of medium stature and iiUermedialc coloring, 
but dclinitely brachycephalic, or broad-headed. In tlie Mediterranean 
group, we again find a pronounced clolichocephaly, accompanied by 
black or brown hair and eyes, relatively dark skin, and short stature. 
As its name implies, this group is found principally on the shores of 
the Mediterranean, comprising most of the population of Spain and 
Portugal, suuthern France, southern Italy, Greeec, and certain parts 
of northern Africa. The Hindu, although darker skinned, bears a very 
close resemblance to the Mediterranean and is sometimes included 
within this group. 

The Mongoloid race is characterized by straight hair, very little 
hair on the face and body, thin lips, and frequently the cpicanthic fold 
which produces the appearance of “oblique” eyes. Short limbs are 
usually the rule in this group. Skin color may be yellow, brown, or 
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reddish. This race comprises the Oriental Mongolian, as well as the 
American Indian and the Malaysian. All three are believed to have 
evolved by differentiation of the same primary stock. Close and exten- 
sive observation shows the physical differences between these groups 
to be much less significant than is popularly supposed. 

The Negroid race has relatively long arms and legs, woolly hair, 
relatively little hair on the face and body, full lips, and a flat nose. Skin 
color is black or dark brown. This stock has been subdivided into 
the African Negro proper, the Oceanic Melanesian, the Pigmy Black, 
and the Bushman. 

1 here still remain certain groups of people of doubtful classifica- 
tion . These cannot be assigned definitely to any one of the three 
major human stocks. These peoples exhibit the characteristics of more 
than one group and would thus be classified inconsistently with re- 
gard to diflerent sets of racial criteria. They include such groups as 
the Australian and Indo-Australian (Veddoid), the Polynesian, and 
the Ainu, a people of low cultural status inhabiting an island off 
the coast of Japan. 1 he Ainu have both Caucasian and Mongoloid 
traits, but arc characterized by a thick hair-covering on the entire 
body. The impossibility ol classifying these groups is not a serious 
deficiency of the present schema, however, since they comprise only 
a very small segment of the human species. It has been estimated that 
about 99% of all mankind can be assigned to one or another of the 
three major laccs. 

This classification is a decided improvement over the traditional 
“five races,” but some oi its subdivisions probably still represent 
ovcrsimplilications. This is particularly true of the tripartite division 
of luiropean peoples into Nordic, Alpine, and Mediterranean. In an 
extensive analysis of available data from a variety of sources, Coon 
(13) concluded that the “races of Europe” fall not into three but 
into ten principal categories, several of which can be even further 
subdivided. His principal categories inchide* Brlinn, Borreby, Alpine, 
Ladogan (two subdivisions), Lappish. Mediterranean (three sub- 
divisions), Nordic (four subdivi ous), Dinaric, Armcnoid, and 
Noric. Coon has called attention to the fact that the simpler classi- 
fication into Nordic, Alpine, and Mediterranean, first proposed by 
Ripley in 1899, mav have paved the way for facile “typing” of 
individuals and glib use of racial catchwords. It is certainly easy to 
slip into the stereotype of “tall, blond Nordics,” “short, dark Medi- 
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terraneans,” and Alpines who are “intermediate” both geographically 
and physically. An awareness of the probable complexity of the total 
picture may help to check such oversimplified stereotyping. 


NATIONAL AND LINGUISTIC GROUPINGS 

Racial affiliation should not be confused with nationality. A race 
is a biological group; it implies a cert lin community of hereditary 
background and is identilied by physical criteria. A nation, on the 
other hand, is a cultural, political, and geographic grouping. It has 
been a common practice, especially in the popular literature on the 
subject, to regard all the individuals of a given nation as members 
of a single race. This is far from the truth and can yield only mis- 
leading results. 

In France, for example, can be found Nordics, Alpines, and 
Mediterraneans (as well as other groups included in the finer classi- 
fication discussed above), dillcrcnt strains predominating in different 
parts of the country. In modern Germany, true Nordics are relatively 
scarce. Certain Nordic sub-groups predominate in small areas of 
Germany, but other regions arc populated principally by Borreby, 
Ladogan, Alpine, Noric, and Dinaric strains. Although Mediterran- 
eans predominate in southern and central Italy, northern Italy is 
largely Alpine and Dinaric. Mediterraneans also predominate in cer- 
tain sections of Ireland, England, Scotland, and Wales, other regions 
containing principally Nordic, Brlinn, and a few other scattered 
strains. It is thus apparent that any racial classification must be made 
on an individual rather than on a national basis. 

Another common source of confusion is that between racial and 
linguistic or philological categories. Thus such terms as “Latin,” 
“Aryan,” and “Semitic” are frequently employed in popular discussion 
to signify races. But the groups which now speak languages of Latin 
origin — including French, Italian, Spanish, Portuguese, and Rou- 
manian, among others — present an extremely varied racial composi- 
tion and arc not a unit in any but the philological sense. The term 
“Aryan” is likewise a very broad one applied by students of linguistics 
to all those peoples using a derivative of the original Indo-European 
language. Similarly, the term “Semitic” refers to a group of languages 
and not to any biologically distinct group of people. 
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In this connection, mention may also be made of the Jews, a group 
characterized by religious and other cultural uniformities, but racially 
very heterogeneous. Believed to be originally Mediterranean, this 
group now contains Nordic and Alpine elements, as well as mixtures 
of several other European strains (13). Alpine characteristics, such 
as brachycephaly, arc more common than Mediterranean character- 
istics in the group today, but the most conspicuous fact is undoubtedly 
the wide diversity of physical types represented. 

The loose use of national, linguistic, and even religious nomencla- 
ture interchangeably with racial designations has further complicated 
an already dillicull problem of classification. It is well to bear in mind 
the distinction between these various types of categories. 

RACE MIXTURE 

An additional dilliculty in the way of racial classification is intro- 
duced by the extensive amount of race mixture which has been going 
on for countless generations. Such mixture is particularly common 
among the sub-groups of the white race. Consequently, it is difficult 
to find many “pure” Nordics, Alpines, or Mediterraneans even in 
those regions which are supposed to be characteristieally populated 
by these groups. Similar interbreeding has occurred to a greater or 
lesser extent among nearly all racial groups. There exist at present 
only a very small number of isolated primitive groups which may be 
regarded as racially ''pure.” 

When tlie racial mixiurc has occurred in violation of social dictum 
or group mores, as in the case of wffiitcs and Negroes in the United 
States, the problom of racial idt atificalion is further confused by 
the arbitrary classification imposed by society. Thus a "Negro” in 
many parts of the United States means an individual with any dis- 
coverable traces of Negro ancestry. Biologically such an individual 
may be much more closely affiliated with tHe Caucasian than with the 
Negroid race, but culturally he is usually a member of the Negio 
group and shares the social heritag. of the latter. 

Race mixture, or nusccgcnation, is a problem which has aroused 
much discussion in its own right. Its advantages and disadvantages 
have been argued at great length; enthusiastic exponents can be 
found for both sides of the question. Among those who consider 
miscegenation biologically injurious may be cited Davenport (15) 
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who argued that race mixture produces physical as well as psycho- 
logical “disharmony,” the mixed group being a “badly put-together 
people.” Negroes, for example, have relatively long limbs, whites 
relatively short limbs. Interbreeding between these two groups might, 
according to Davenport, result in individuals with long legs and short 
arms, or vice versa. Similarly, the mixture between a race with large 
teeth and large jaw's and one with small teeth and small jaws might 
produce individuals with disproportionate combinations of jaws and 
teeth. This was, in fact, ollcrcd by Da import as a possible explana- 
tion for the frequency of tooth decay in the United States, since 
Americans represent a mixture of so many different strains! 

The hillacy in this argument lies in its assumption that specific 
organs are inherited as unit characters. The relation between an 
individuaTs bodily or psychologieal traits and his gene constitution 
is, of course, much more complicated than that. In the process of 
growth, moreover, all parts of the organism interact and inllucncc each 
other's development, thus producing a balanced and harmoiiious 
relationship of parts.* Observations on hybrids, both in the human 
and in infrahuman species, reveal no significant di'-harmonics. The 
success of many animal breeding experiments certainly testifies to the 
beneficial results wdiich may be obtained with race crossing. Physical 
measures of hybrid races ha\e likewise shown either an increased 
physical vigor in the hybrkl generation or a piiysical status which is 
midw'ay between those of the parent groups. Jn no case has a consistent 
physical injeiiority of a hybrid group been rcliahh established. 

'file effects of race mixture have also been discussed from the 
standpoint of the historical achie\ enients of various groups (cf. 4S). 
I vvo opj)osed theories have been proposed regarding the influence 
of race mixture upon the rise and fall of civilizations. On the one 
hand are cited ancient Hgypt, classical Greece, and the Roman Em- 
pire, whose decline coincided with a widespread intermixture with 
culturally underprivileged immigrant or servile groups. Similarly, the 
relative backwardness of certain prcscnl-day groups, such as arc 
found in jMexico and South America, has been attributed to the fact 
that they arc of hybrid stock. 

An equally strong case can he presented, however, in support of 
the opposite theorv. Racial purity is often associated with a very 
low level of cultural development. Thus among the most racially 

‘ Foi a fuller discussion ol these cnlicjsnis, cf. lU, 11. 
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pure human groups may be mentioned the hill folk of India, the 
Andaman Islanders, and certain }?.skimo groups. In our own country, 
the closest approximation to purity of racial stock is probably to be 
found in certain isolated mountain communities, wliich are notori- 
ously backward in social and intellectual development. Conversely, 
the acliievcmenls of western civilization can be sliow'ii to be the cul- 
Uiral expression of hvhrid stocks. All the great European nations 
present a coiiiplcxity of racial composition. The history of the United 
Slates furnishes a particularly sti iking example of the achievements 
of a highly mixed grcaip. It can also be shown that many great men 
v/cie the prcKluct of much interbreeding of diver ve stocks. 

The apparent inccmslstencies in such data arise frotn the attempt 
to estabhsii a causal rclaiionsbip between race mixture and ciillural 
level. There is, in fact, no reason to expect a direct relation between 
the two. Both may in turn be dependent upon a third iacior, the 
degree of social contact or social isolation of a group. Cnhural dc- 
vek^pment is usually promoted by contacts between groups, with the 
resulting interchange of diverse material and inlclieetual products. 
At the same time, such contacts are conducive to race mixture. Hence 
a heightened cultural development is often found in association with 
race mixture. 

In certain situations, social factors may cause the rewerse relation- 
ship to hold between degree t^f racial purity and cultuial develop- 
ment. Thus in a period of cultural degeneration, miscegenation w'ith 
a despised group may be tolerated as social barriers arc lowered. In 
such a case, as in anciem cmpiies in their decadent periods, the race 
mixture is but another symptom of a disruption of traditional be- 
havior and may temporarily come.vie with a period of low imcllcctnal 
achievement and cultural detcrioi alien. In either cas.\ the associa- 
tion betww‘cn race mixture and cultural level is an indirect one, and 
cannot be eitetl as evidence for a biological basis of cultural de- 
velopment, 

HVlMtcTRANT GKOUPS 

Many alleged “racial” ccanparisons have been made on immigrant 
groups in the United States, the subjects being classified according to 
country of birth. If American-born children of immigrants are em- 
ployed, they are usually classified on the basis of parents' birthplace. 
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Such investigations cannot yield any information on the problem of 
race differences. As has already been pointed out, national groups 
cannot be assigned as a whole to one or another racial stock. But 
even for the study of national differences such data are inadequate 
and misleading. Immigrants cannot be assumed to be representative 
samplings of their nome population. They are not drawn propor- 
tionately from all educational, economic, and social levels, but usually 
represent a select group. 

A more serious dithculty is that such selective factors may operate 
difierenlly in each country. As a result, immigrant groups from dif- 
ferent nations are neither fair samplings of their home populations nor 
comparable amom: themselves. If it could be shown, for instance, that 
immigrants irom all nations were drawn consistently from the lower 
socio-economic le\els, then such groups would at least be com- 
parable with each other. But it is well known that, through purely his- 
torical reasons, the immigrants from some nations may represent a 
relatively inferior sampling of their population, from others a more 
nearly random or average sampling, and from still others a relatively 
superior sampling. Moreover, the nature of the sampling from a given 
country may change markedly iiom lime to time. 

It has been Irequcntly suggested, for cvatnple, that the superior 
performance oi Cliincse and Japanese children in America on many of 
our intelligence tests may be the result of selective lactors, only the 
more progressne families emigrating Irom ihesc countries (cf., e.g., 
14. 49). Many of the immigrants from southern Europe, on the other 
hand, are probably an interior sampling of their own national popu- 
lation. In one investigation (22), groups of Danish and Italian girls 
in the Ihiiled Slates and in Europe were examined wath the Inter- 
national Group Mental lest. Although the Danish samplings in this 
country excelled the Italian, no significant difference was found bc- 
tw^cen the gioups tested in Copenhagen and in Rome. 

It is apparent that the testing of immigrants can throw little or no 
light upon the relative ‘tatus of the national groups from which they 
are drawm. it might be argued, however, that the determination of 
the abilities and personality traits of the immigrants themselves is of 
direct practical value for restriction of admittance, assimilation, and 
similar purposes. Such an argument fails to take into account two im- 
portant aspects ol the problem. In the first place, the behavior of immi- 
grants may simply reflect their former envmmmental background. Wc 
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cannot assume that the emotional and intellectual traits of such per- 
sons are innately determined just because they persist under the new 
environment. The influence of early conditioning is too strong to be 
readily wiped out. Similar traits would also be noticeable to a slighter 
extent in the offspring of immigrant parents, as long as family tradi- 
tions and the practices of the home country endure. 

A further point to note in the study of immigrant groups is that the 
immigration itself, with its resulting necessity of adjusting to a new 
culture, is an important environmental inlluence (cf., c.g., 52, 58). 
This factor cannot be ignored in analyzing the intellectual and emo- 
tional make-up of the immigrant. The confusion of standards and the 
shifting reference points contingent upon such an adjustment cannot 
fail to have an etfect upon the subject’s behavioral developmeiit. The 
point has frequenlly been made that the maladjustment is iireatest, not 
in the case of the immigrating generation who retain their customs to 
a large extent, nor in the case of the third and succeeding generations 
where adaptation and assimilation is virtually complete, but in the 
case of the offspring of the immigrants — or second generation — who 
are caught in the maelstrom occasioned by two different frames of 
reference. 

For example, a survey with, the Woodworth-Mathews Test of Emo- 
tional Instability revealed a much higher average number of neurotic 
symptoms among the children of immigrants than among those of 
native parentage (45 ) . The children tested ranged in age from 9 to 19 
and in school grade from the fourth to the twelfth. Iloth sexes were 
included in tlie sTiulv. d he median number of neurotic symptoms re- 
ported by each of the three major groups selected for comparison was 
as follov/s: 
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Data such as tl-esc do not constitute 'it adequate basis for the conclu- 
sion that Jewish or Italian groups in this country are by nature emo- 
tionally unbalanced. In a similar situation, the ‘‘normal" individual 
upon whom the test was standardized might have reacted similarly. 
The fact that immigrant groups often live under poorer socio- 
economic conditions than the native population may likewise affect 
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Ihcir intellectual, emotional, and social adjustment. Such effects will 
be considered in the appropriate section of the following chapter. 

DII FI.RI NTIAL SOCIAL SLLLCTION 

A type of selective sampling which complicates certain comparisons 
among racial or national groups results from difTcrential social selec- 
tion. We have already seen examples of such diiTercntial selection as 
it operates with respect to men and women (Ch. IS). In a similar 
manner, it may operate with respect to various minority groups living 
within the same country. In evaluating any results on special groups, 
such as college students, army inductees, or institutional populations, 
we need to be on the alert for possible spurious effects resulting from 
such selective factors. 

For example, comparisons of the test performance of Negro and 
white soldiers in World War 11 are complicated by the fact that Selec- 
tive Service screening standards were apparently different for the two 
groups (32). Similarly, in World War I, the policy with resjX'Ct to 
the administration of the Alpha and Beta examinations varied some- 
what from camp to camp because of practical exigeniMcs (cf. 24). 
d'hiis in some localities men vvJio scored behnv 30 on Alpha might be 
rc-examined with Bela. In others, because of more demand for Beta- 
testing, all those who oblaipcd an Alpha score barely higlicr than zero 
were classified on the basis of Alpha alone. This would obviously 
afiect the comparisons of Alpha or Beta averages among groups tested 
in different localities, Since the proporiion of foreign-born or of 
Negroes also differed irorn camp to camp, comparisons between these 
groups and nativc-boin white draftees would be correspondingly 
affected. As a matter of fact, such a practice wvuild tend to c\aggcyatc 
the diricrcnccs among the groups being compared. 4"hose groups in 
which the need for Beta w^as more prevalent (because of language 
handicap, illiteracy, and the like) would be the vciy ones in which 
tlie use of Beta had to be restricted more stringently.'* Hence in such 
groups only persons receiving Alpha scores close to zero would have 
the benefit of a retest with Bela. 

Another illustration of differential selective factors is provided by 
comparisons of Negro and white college students. Jn one typical inves- 
tigation (21 ), Negro college girls were found to be significantly more 

•''This was not, of coui.se, true in all camps, hut only in those in wh eh testing 
facilities wcic relatively inaclecjiiate 
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“self-sufficient” as indicated by the Bernreuter Personality Inventory, 
the remaining scores on this lest yielding no significant differences be- 
tween the two groups.’* Docs such a finding demonstrate that Negroes 
arc more scJf-sulhcicnt than whites? Obviously not, since only college 
girls were tested. Docs it indicate that in the upper intellectual levels 
Negro girls are more self-sufficient than white girls? Not necessarily, 
because Negro girls who go to college may represent a selected sam- 
pling not only with respect to intellectual level, but also with respect 
to a number of personality traits. It may require more self-sufficiency 
for a Negro girl to continue her education than it docs for a white girl, 
because of the relatively greater economic and social obstacles which 
the former must overcome. Any personality dinerence between the 
two groups may thus do no more than reflect these differences in the 
operation of selective factors. Of course, we must also consider the 
possibility that g(ung to college may itself be a factor in increasing the 
self-sufficiency of a Negro girl. The realization that one has success- 
fully surmounted obstacles is probably an important condition in the 
development of feelings (;f self-sulliciency. 

Statistics on crime and insanity arc especially subject to differential 
selective facUn-s. Siatcments have often been made regarding the 
“predisposition” of various racial groups to crime, 'file large percent- 
age of crime in the United Stales has been attributed by some to the 
inlUix of certain classes of immigrants into our ccnintry. Statistics have 
been cited to show the greater frequency c*f crime among Negroes and 
aiming immigrants from e ’stern and soutlvem Europe than among the 
nalive-boni white pO;pulation. 

Figures often lie, and in the interpretation of crime statistics it is 
particularly difficult to disentangle the many imconlroiled factors 
which confuse the issue. Among such factors may be mentioned the 
inequality in arresl^ and eonvictiems among various groups; Negroes 
and “foreigners,” for example, are more readily arrested “on suspi- 
cion” and on less grounds them is gencrall\ required for native-born 
whites. The fact tliat most foreigners arc adults would also give Ihem 
a disproportionate percentage of crime if they are compared with the 
figures for native-born persons of all ages. Similarly, forcigneis are 
more often city-dwcllcrs and live under poorer social and economic 
conditions than native-born Americans — all of which is conducive to 

’’Cr. p 672 in Chapter tor a list of the other aspects of personality covered 
by this test. 
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crime. The foreigner, furthermore, may have brought with him tradi- 
tions and folkways which happen to conflict with the accepted be- 
havior in our country. Mexicans in the United States, for example, 
show a relatively large number of arrests for carrying concealed 
weapons (60). In this they may simply be continuing habits which 
they acquired in their own country in a perfectly legitimate way. 
Despite the many factors which load the dice against the foreign-born 
in crime statistics, careful analyses of the data on native and foreign- 
born persons over 18 years of age have failed to reveal a higher rate 
of arrests, convictions, or commitments among the latter (cf. 60 ) . 

The American-born children of immigrant parents present a some- 
what dilTerent problem. On the whole, the crime rate among such 
“second-generation” Americans is higher than among offspring of 
native parents (28, 52). The conflict between the old and new culture 
is undoubtedly an important factor in the emotional and social malad- 
justment of such individuals (27, 51, 52). At the same lime, it should 
be noted that foreign parentage need not in itself be associated with a 
higher crime rale. Surveys liave shown that in many states the sons of 
immigrants have a lower commitment rate than the s^)ns of native 
parents (9, 55). Those states having a higher crime rate among per- 
sons of foreign parentage arc generally the more highly industrialized 
and urbanized slates. They also contain a larger proportion of immi- 
grants from those European cultures which aie most unlike our own. 
llius the socio-economic conditions under which the immigrant groups 
li\'c, as well as the degree ol conflict between the old and now cultures, 
seem to be major factors in determining the crime rate. 

Most of the conditions which render the evaluation of crime statis- 
tics diHicuIt also affect the data on insanity. In addilicjn may be men- 
tioned the factor of hospitalization. Institutional subjects may not be 
a representative sampling of the actual cases of mental disorder in 
dilTerent groups, since the available facilities for hospital care arc not 
equal in diflerent parts of the country. On the other hand, because of 
economic conditions, certain groups are better able to care for the 
mentally disordered persons at home, thus eliminating the necessity 
for hospitalization. It is interesting to iKfle that, although the uncor- 
rccted hospital statistics show about twice as many cases of insanity 
among the foreign-born as among the native-born, the difference vir- 
tually disappears when various corrections are made for sampling 
inequalities (42). 
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Tn 7 hi: PRrrrDTNG chmmir it v/ds shov/n that the classification of 
ijidividiials inlo distinct races, as well as the cluiicc of gioups suitable 
tor racial comparisons, presents nnmy ditliciiltles. Even when a satis- 
factory sclccli(jn of ‘-nhjccts has been made, however, additional prob- 
lems remain to be solved. It is not suthcient to determine whom to 
measure. The questions wlial to measure and how to measure it are 
equally important. Thus it is necessary to decide which are the most 
significant traits tor comparison and what materials and techniques 
aie applicable to die testing of culturally dissimilar groups. The inter- 
pretation of the obtained dinerencvs also niiscs important questions. 
Is it possible to establnh a universal criterion oi “intellect’' so that 
we may speak of one group being intellectually “superior” and 
another “inferior”? What shall we use as norms or standaids for the 
evaluation of widely diverse peoples? The latter is a very fundamental 
issue in differential psycho! >gy 

Individuals who diller in racial afTiliation also diHer in many other 
respects. It is theretorc ^ery dilTicult to isolate the lactor of race so as 
to determine its diiccl inllucncj up(Mi the subject's behavioral develop- 
ment. Members dillcrcnt lacial gioups Irequently speak different 
languages, a fact which greatly restricts the range of traits in which 
inter-gioup comparisons can validly be made. The dilTercnces in gen- 
eral educational opportunities and specihc type o1 training available 
to each group have an undoubted influence upon psychological test 
performance. Such groups may likcw .>e differ in their general social 
and economic level and in the lacililies for intellectual advancement 

^ For the sake of hieM‘l>, the lerni “race” will be employed Without quotaiion 
marks or othci qiinlrlicacotn. to refer to groi.r'»s so dcsignaled in the particular inveiti 
gallon under consicki alion. Il is not Ur be assumed, theien)ie, lh..t such groups 
constitute races iii the sense rn which this Uim was defined in the preceding chaptei 
la each case, the natuie of the groups will be appaicnt from the conteU. 
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ofTered in their own homes. The background of tradition and culture 
against which the individual develops is also fundamentally diverse 
from group to group. The emotional atlilude>, interests, ideals, and 
prclerences fostered by such surroundings will nol be the same, lo 
this may be added the man> dithculties arising when an examiner from 
one racial or national grtuip administers psxchcdogical tests to subjects 
in another group, lliis situation is not comparable to the testing (4 
sul)jects within one’s own group. 

A consideiable body of evidence is available which demonstrates 
the iniluence ol the above factors upon p'-^chol >gical test pcrlormante. 
I reqiiently such data weie gathered incidentallv in studies whose 
major purpose was tlie establishment ol ia.ee dilieruyees in ability. A 
few iii\e>tigalions, (m the other hand, have been coiidueted with the 
explicit aim ol anai)/ing the pitlalls in i<iciil compiirisons. In cither 
Cvise, the data seem cisar in their implication that factors cither than 
race aic operate e in cdleged racial dinerciices ll should be noted that 
the cjuestion (4 rav.e u/sus cultuie in d'c production ol gtiuip dilTer- 
ences is one phase ol the gen^i il piohkrn ol heicd)l\ and envnon- 
nient. Race, it will be lecalkd, is a biological coiuepi based upon 
hcrulitarv commiinitx. Culture, on ttie other harii,^teki> to the 
enMionmcntal condition^ and behavior shared by the nienibeis of a 
single gioup. 

Tin COMP\R\ilM AC rril \ I Mr N7S (M DlIIIKINi R \C 1 S 

The point is somclinies made that the \a‘t dilleiencvs in the achieve- 
ments of various races tesiif) to their dr similar innate psvchological 
equipment, I hus jt is argued that the diheiencLS in cuituial level 
among racial groups iiiight be a uMth rather than a eausc ol psycho- 
logical diffcreiKcs among such giinips. The cultural milieu in which 
the individual is reared, with its special opportunilies and limitations 
for intcliectual and emotional development, mi‘?ht itself be a leilcction 
of the capacities of each race. The individual of a given race might 
thus be handiCiippid by po^r facilities lor inlelleetual development 
ju‘'t because his predecessors larked the capacity to produce a more 
“favorable” en cironment. 

As evidence o\ the wide inter-racial variations in achievement are 
cited contributions to science and invention, accomplishments in the 
realm of literature and the other fine arts, complexity of social and 
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political organization, technological advances, and many other aspects 
of cultural status. Comparisons have also been made in terms of the 
“eminent” men produced by each race. Thus Galton (24, pp. 325- 
337) at one time proposed a 16-point scale for estimating the “com- 
parative worth of dilfercnl races” by comparing the relative merits of 
men in each group who had achieved eminence. On this basis he sug- 
gested, for example, that the Negro is two grades lower than the ling- 
lishman, and the modern Englishman two grades below the Athenian 
of Greece’s golden era. 

It should be noted that any argument based upon the relationship 
between the cultural level and the capacity of races is reversible. On 
the basis solely of the association between these two faciors, it is im- 
possible to determine which is cause and which is elTect. There is 
therefore no reason for concluding ipso fat to that racial dilTcrenccs 
in cultural achievement indicate or result from a racial dilference in 
capacity. There is considerable evidence, on the ('tlier hand, which 
suggests that the cultural diiTerences may be responsible for the group 
dilTercnces in ‘'capacity.” 

In the fust place, achievement and cultural level are frequently 
found to vary not with race but with environmental factors. 'Ihus a 
group whicli is characterized by a given aclhevement level may be 
racially very heterogeneous and may constitute a unit only in terms 
of a common experiential Ivickground.^' Secondly, the relative achieve- 
ments of a given group are intluenccd by a number of factors which 
cannot themselves be atliibuted to racial capacity without stretching 
the point unduly. 1 he chaiactcristics of the physical cinironment, the 
degree of contact wnlh other groups, llie discovery of new' routes of 
travel and communieation, and hisu’irical events within (ft/icr groups 
— and thus not within the control of the group under cemsideration — 
have played an important part in the cultural development of many 
societies. 

Thirdly, mention may be made of certain broad shifts in the rela- 
tive cultural status of dilTercnt racial groups fr(>m time to time, dliis 
is particularly v/ell illuslralcd by soir.c of the ancient African civiliza- 
tions, such as the kingdom of Benin, whose achievements in many 
fields far outstripped the European cultures of the same period. A 
number of “lost arts” of these civilizations represent, in fact, abilities 
which have never been attained in any other group. In several cases, 

“ DiJta bearing on this point will be found in the following cliapter. 
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the shifts in relative cultural level occurred in the absence of any 
known change in the nature of the stock, as might occur through race 
mixture. Concomitant historical developments can, however, be found 
which might account for the change in cultural level. Finally, the 
reader may consider in this connection the weight of the evidence from 
other sources, discussed throughout the present book, which indicates 
the extent to which behavioral development depends upon environ- 
mental factors. 

In connection with the comparative achievements of different races, 
mention should also be made of the theory that “primitive” man excels 
in sensory capacities, in contrast to “civilized” man’s ‘ superior intel- 
lectual equipment.” I his theory has been especially proposed as an 
explanation of the remarkable feats of primitive persons in such tasks 
as the recognition of biids or animals concealed among foliage, the 
interpretation of footprints, the use of olfactory cues in finding one’s 
way or in identifying animals, and the like. Considerable evidence has 
been accumulated, howexer, to show that such achievements are not 
attributable to superior sensory equipment. Rather do they result 
from the indixiduars having learned to respond to ver) slight cues and 
to discriminate small diflerences. The siiuation is rouehly similar to 
that undei lying the blind person*^ ’^kill in icsponding to audilorv and 
tactual cues. Tne need'- of life m a primitive environment are such iis 
to encourage the learning o( proper responses to slight sensory cues 
which may spell dinger, food, or other urgent matters, that ^ixh 
achieveinv-nls result from learning rather than from race difieiences 
in acuity is suggested by the laet that persons from “civili/ed” coun- 
tries have proved able to Ijcirn similar responses when put in situa- 
tions which demanded them, 

Ob)ectiv/e te^is ol sensory acuity have likewi'^e lent no support to 
the view that primiuve man's achievements result fiom scii'^t^ry superi- 
ority. As early as HJ04. at the St. Louis World’s 1 air, Woodworth 
(63) aivt Bruner (11) applied what few tests were then available to 
.such groups as Americ<m Indiana, Negritos from the Philippine 
Islands, Mala)<in biiipinos, Ainus fiom Japan, Africans, Eskimos, 
Patagonians, and others. White visitois to the exposition were simi- 
larly tested. On such controlled tests of sensory acuity, the primitive 
groups did no better than the while norms. Subsequent investigations 
on many different groups have coiroborated these findings. 

A similar explanalion in terms of learning rather than sensory dif- 
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ferences seems to hold for alleged racial differences in musical 
achievements. The aesthetic intricacies of Indian dances have led 
many observers to ascribe a superior musical sensitivity to that race. 
Even more familiar is the traditional musical talent of the American 
Negro, whose achievements in this respect have become an important 
element of American music. That cultural rather than racial factors 
account for these accomplishments is suggested by extensive surveys 
with the Seashore Measures of Musical Talents. In the discrimination 
of pitch, intensity, time, and rhythm, as well as in the other simple 
tests in this well-known scries, no significant superiority in favor of 
Indians or Negroes has been found (5, 26). 

It is thus apparent that the cultural achievements of difTcrent groups 
cannot in themselves serve as an index of the relative abilities of 
human races. In the effort to obtain a more direct measure of abili- 
ties, psychologists have administered a wide variety of tests to individ- 
uals of different races and cultures. A mass of data, ranging from 
indices of simple sensori-motor abilities to measures of complex 
intellectual and emotional characteristics, have thus been accumu- 
lated on various racial groups. The interpretation of these findings, 
especially in the case of the more complex functions, is beset with 
many difficulties or pitfalls. In the remaining sections of the present 
chapter, we shall consider some of these dilliculties. 

LANGUAGE HANDICAP 

It is obvious that in the comparison of groups speaking different lan- 
guages, verbal tests cannot be employed. Non-language and perform- 
ance tests have been devised for this purpose. It is not to be concluded, 
however, that the same traits are being measured by all these tests. As 
was shown in Chapter 14, many tests included under the heading of 
“intelligence scales” call into play widely different abilities. Thus 
when unfamiliarity with the language makes the application of verbal 
tests impossible in a given group, the range of processes which can be 
nieasLiied in that group is thereby narrowed. There is no substitute 
for verbal tests. It is a psychological impossibility to eliminate the ver- 
bal content of a test without altering the intellectual processes 
involved. 

The actual effect of language handicap upon test performance is 
likely to be most serious, however, when such a handicap is present. 
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in a mild degree. If the individual has a moderate understanding of 
English, it is usually deemed unnecessary to give him a non-verbal 
test. But such an individual may lack the facility in the use of English 
or the range of vocabulary required to compete fairly on a verbal test. 
This situation is often encountered in immigrants who have lived in 
America for many years, or in the children of immigrants. The latter 
are frequently bilingual, speaking their own language at home and 
English at school. 

The fact that children with such relatively mild language handicaps 
generally obtain lower intelligence test scores has been frequently 
demonstrated. When children in the same schoolroom are tested with 
a common verbal intelligence test, those from foreign-speaking homes 
generally make a poorer show^ing as a group. In an invtytigation con- 
ducted in New Jersey, for example, children of Italian patpntage were 
given the Otis Group Test (46). When the children were divided into 
four language groups — those who spoke cmly Italian at home, those 
w'ho spoke Italian and some English, those who spoke English and 
some Italian, and those who spoke exclusively English-— a consistent 
rise in average score was found with increase in amount of E.nglish 
spoken at home. 

In an analysis of data secured independently by dilTcrent investi- 
gators, Goodenough (30) found a correlation of —.75 between the 
average 10 of children in various immigrant groups and tiie tendency 
of such groups to retain their own language lor use in the home. An 
index of the latter was obtained by finding the ratio of the number 
of parents in each national group who had been in this country for a 
period of 20 year> or over and had not ad{)pted fmglish, to the total 
mimhcr of parents in that group who had adopted it. Tlic high nega- 
tive eoi relation between tliese two factors indicalcs a strong tendency 
for children in those immigrant groups in which English is not readily 
adopted to obtain lower scores on o,ur inic!ligenee tests. Goodenough 
points out that there are \wo possible explanations of such a tinding. 
On the err: hand, the lower intcliigeiice lest scores of some groups 
may result directly from their greater language handicap. On the odier 
hand, those national groups in which English is not commonly :idopted 
may be less intelligent and less progressive from the outset. Their 
failure to learn English would thus be the result of low'c»* intelligence 
and poorer adaptability. 

Neither interpretation can be selected solely on the basis of the cor- 
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relation between the two factors. Other data, however, suggest that 
the former is the more probable one. It is quite likely, for example, 
that because of the greater similarity of some languages to English it 
is easier for individuals from certain countries to learn English, quite 
apart from their intellectual level. Another factor of possible relevance 
is to be found in the reasons for which immigrants from various coun- 
tries come to America. 1 hose from some countries may come largely 
with the intention of settling permanently; those from other countries 
may traditionally retain a vague hope of returning to their home coun- 
try after ‘‘making their fortune.” Such unpermanence is likely to oe 
reflected in their halfhearted attempts to master the English language 
or to see that their children master it. A further question is whether 
the immigr^ come into a community of their own compatriots ^ as 
found in ngn neighborhoods of some of our larger cities, or 

whether they scattered in predominantly American communities, 
Jn the case of certain national groups, represented by relatively small 
numbers of immigrants and not concentrated in any one area, the 
individual has little choice but to learn English. 

The most crucial argument regarding the role of language handicap, 
However, is provided by the finding that the infciioriry of the immi- 
grant groups is greatly diminished and may disappear entirely when 
non-language rests are employed. This has been repeatedly and con- 
sistently demonstrated with many groups. Pintner (33) tested 165 
school children of Italian, Cierman, and Polish parents and 121 chil- 
dren of American-born parents, mostly of Jnsh descent. On the Na- 
tional Intelligence Test, a pretiominanlly verbal test, only 37% of the 
foreign-parentage children reached or exceeded the median score of 
the native-parentage children. On the Pintner Non-Language Scale, 
however, 50% of the foreign groip reachetl or exceeded the native 
median, i.e., the two groups had identical medians. A similar result 
was obtained in an investigation at the prcscliool level (16). Two 
groups of nursery school children, one bilingual and the other mon- 
oglot, were matched in age, sex, and paternal occupation. The second 
language was Italian in all cases, " two groups, each consisting of 
106 children, were given the 1937 Stanford-Binct and a performance 
test, the Atkins Object-Fitting lest. The bilinguals were significantly 
inferior to the monoglots on the vSianford-Binet, but significantly 
superior to the monoglots on the object-fitting test. 

In a carefully controlled and extensive study, Aiscnian (2) admin- 
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istered the Pintner Non-Language Scale to 1152 American-born chil- 
dren of Italian parents and 1196 American-born children of Jewish 
parents. The children ranged in age from 9 to 14. Degree of bilin- 
gualism was ascertained by means of a written questionnaire. The 
results showed no significant correlation between extent of bilingual- 
ism and intelligence lest score in either group. The correlations were 
—.079 and - -.193 in the Italian and Jewish groups, respectively. 

It is interesting to note that, when the language handicap is fairly 
pronounced, even the use of English in giving directions in a non- 
verbal test may affect the scores. In one study, 236 Spanish-speaking 
children in the first three grades of Arizona public schools were given 
the Otis Primary Group Intelligence lest (4(S). This is a non-vcihal 
test with oral instructions, fo check the effect of bilingualism, one 
hall ot the group was given Form A with Spanish instructions, fol- 
lowed about ten days later by Form B with English instructions. The 
procedure was reversed in the other half of the group. The mean IQ 
was found to be 96.15 on the Spanish form and 86.87 on the English 
form. No child iccci\ed a higher IQ on the English than on the Span- 
ish form, although 9 received the same IQ on both. The largest indi- 
vidual difference in favor of the Spanish form w'as 44 points. It should 
be noted that this was a non-verbal test. Spanish being used only to 
give directions. Had the children been given a piedominantly verbal 
test in Spanish, they might have done just as poorly as they would on 
a verbal test in English. When a child speaks one language at home 
and another at school, his mastery of hoi/i languages may suffer as a 
result. 

This was demonstrated in an investigation conducted on Welsh 
children (4). Two verbal intelligence tests, a word-knowledge test, 
and the Pintner Non Language Scale were given to 10- and 11 -year- 
old children in a Welsh-speaking and in an English-speaking district 
of Wales. School instruction was conducted in English in both areas. 
As in other studies, the bilingual children were found to be superior 
on the non-language scale but inferior on the verbal tests. When Welsh 
forms of the verbal intelligence tests and the word-knowledge tests 
were administered, the inferiority of the bilinguals was even greater 
than it had been on the English forms. In other words, these children 
did even more poorly in their “home language” than they did in their 
“school language.” Such a result is not, to be sure, surprising. If we 
consider the acquisition of vocabulary as an example, it is apparent 
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that when the child speaks one language at home and another at 
school, he will learn a somewhat different set of words in the two 
situations and his vocabulary in each lan^^ua^e will tiiercby be 
curtailed. 

A further point to bear in mind is that bilingualism per se docs not 
necessarily result in language handicap. Under certain circumstances, 
bilingualism may produce no handicap in one or both languages, and 
in such cases we would not expect it to depress intelligence test per- 
formance. For example, Pjiitiier and Arsenian (54) found no relation- 
ship between degree of bilinguali.'.m and scores on a verbal intelligence 
test in a group of 46^) native-born Jewish school children in New 
York City. The correlation was .059 and fell well within the chance 
value. It has been a general tinding, in fact, that Jewish school chil- 
dren, as well as college students, do especially well on verbal tests. 

\ luis the bilingualism of the Jev/ish child is not such as to interfere 
with his mastery of Fnglish. One reason for this may be found in the 
attitude of the Jewish group toward the two languages, as contrasted 
with the attitude of other foreign-language groups. The Jewish child 
in America will eventually have to make his way in an Fnglish-spcak- 
ing society, and English is therefore of prime importance to him. On 
the other hand, those national groups which arc in large part oriented 
towaird the possibility of returning to their country of origin may 
regard English more as a temporary expedient. Another important 
factor is undoubtedly the strong educational tradition in the Jewnsh 
culture and the parental insistence that the child do well in school, 
especially in the relatively abstract and ''bookish” subjects (8; 39, 
p. 174).’ 

Another illustration of the fact tlvit the mere acquisition of a second 
language need not prove to be a handicap is provided by an investiga- 
tion conducted in Ireland ( 59 ). All the children tested were predomi- 
nantly English-speaking at home, but some attended Irish-speaking 
schools and some English-speaking schools, in both types of schools, 
however, the second language was taught as a school subject, i.e., 
English w'as taught as a separate coi.ive in the Irish-speaking schools, 
and Irish was so taught in the English-speaking schools. On perform- 
ance tests, both groups did equally well. On verbal tests, however, the 
children in the Irish-speaking schools excelled those in the English- 
speaking schools when tested in English. When the tests were admin- 
istc^'cd in Irish in the Irish-speaking schools and in English in the 
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English-speaking schools, the children in the English-speaking schools 
obtained higher scores. It should be remembered that all children 
were bilingual to a certain extent in school, because of the course in- 
struction in the second language. What these results actually show is 
that children will do better on an intelligence test administered in the 
language which they speak at home ami study as a school subject than 
they will in a language which is used in school instruction but not 
encountered cKcwhcrc. I he data further suggest that, when childien 
arc bilingual, they master more thoroughly a language which they 
speak at home and learn as a school subject than one which they 
merely speak at home and in school. 

In summary, bilingualism per se need not handicap a child. Bilin- 
gualism as it occurs in a large proportion of the immigrant ponulation, 
however, is such as to reduce the child’s mastery of either language, 
because one language is restricted largely to one set oi situations in 
the child’s life, and the other language restricted to another set. What 
the child needs is to learn to express himself in at least one language 
in all types of situations. It is not the interference of the two lan- 
guages, so much as the restriction in the learning of one or both to 
limited areas, that produces a handicap. 

TABLE 44 Median IQs of Indian School Children 


( \fl ipftti I otn Jamieson m<l S ii lifoij '^7 S fO 



Number of Casts 

Mt than IQ 

National Intelligence Test 

275 

79 8 

P miner Nv:m 1 anguage Test 

280 

96 9 

Pintnei Pateison Performance TtsK 

115 

96 4 

Pinlncr-C'unningham Pnmaiv Test 

Y) 

77 9 


Studies on language handicap have not been limited to European 
national groups. In investigations on American Indians, the influence 
of language deficiency upon intelligence test performance has been 
vividly demonstrated. Jamieson and Sandiford (37) administered a 
series of standard tests to 717 pupils attending Indian schools in 
Ontario, Canada. All the children could speak English, but their 
ability to do so was below that of the average American child. The 
median IQ’s obtained by the Indian children on each test arc shown 
in Table 44. 
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A comparison of the median IQ on the verbal and non-verbal tests 
reveals the influence of language handicap. On the National Intelli- 
gence Test, a predominantly verbal test, the Indian children arc clearly 
below the American norms. On the Pinlner Non-Language and 
Pintner-Paterson Performance Tests, on the other hand, their per- 
formance is practically up to the norms.*^ The low median IQ on the 
Pinlncr-Cunningham Test, administered to the younger children, again 
suggests the role of language handicap. Although non-verbal in con- 
tent, this test requires extensive and detailed instructions given in oral 
English. 

A more conclusive demonstration of the importance of language 
handicap was provided in the same study by a comparison of a group 
of monoglots, who spoke only English, with bilinguals, who spoke an 
Indian language at home all or part of the time, d he median IQ’s of 
these two groups on each lest arc shown in Table 45. It will be noted 


TABLE 45 Median IQ's of Monogiot and Bilin^nia! Indian School 
CJuldrcn 

<A(].i.pU(l {rjn\ laiJUfson ard S.iud foid, 37, pp. 540 51i) 


Test 

Nnnihcr of Cases 

Median }Q | 

Monoglot 

Bilingual 

Monoglot 

Bilingual 

Niitional Intelligence Test 

153 

115 

82 -t 

76.6 

Piiilner Non-Languatze Test 
Pininer-Patcrson Perform- 

152 

121 

100.0 

93.6 

ance Tests 

80 

30 

9T8 

100.0 

Pintncr-C iinningham T csl 

33 

23 

80 5 

68.1 


that on the performance scale tin hilim^md children obtain a higher 
median IQ than the monoglots, whereas the reverse is true on the 
other three tests. This suggests that the poorer showing of the bilin- 
guals is not due to their inferior mental status but to the verbal nature 
of the test. In the case of the Pintner Non-I anguage Test, it is possible 
that the use of paper-and-pencil materials, as well as the dependence 
of some of the sub-tests upon inlo. mation characteristic of our cul- 
ture, gave a disadvantage to the children from the less highly assimi- 
lated homes. Those children who were relatively unfamiliar with such 

The fact that the medians on these two tests still fell below 100 may be 
explicable m teiins of speed oi W'ork, motivation, dillerencci m t’encral information, 
and other factois to be discussed in subsequent sections of ihe present chapter. 





724 Differential Psychology 


materials would also tend more often to come from Indian-speaking 
homes. 

Subsequent investigations on a number ol diiTcrenl Indian groups 
have corroborated the findings of Jamieson and Sandiford. On verbal 
tests, Indian children average consistently below the white norms, but 
on non-language and especially on performance tests they are approx- 
imately equal to white children (3, 28, 32). In an extensive survey 
by Havighursl and Hilkevitch, 670 American Indian children between 
the ages of 6 and 15 were tested with i short form of the Arthur 
Performance Scale (32). The subjects included Hopi, Navajo, Zini, 
Zia, Papago, and Sioux. Although there were large differences among 
various Indian communities, the total average coincided with the white 
norms. A sub-group of 30 children who had also been tested with the 
Kuhlmann-Anderson received a mean IQ of 82.5 on the latter test and 
J02.8 on the performance scale. I he dinercnccs in score among 
communities were shown to be at least in part associated with the 
degree of contact with while culture. For example, two groups belong- 
ing to the same tribe showed a dilTerence in test score which corre- 
sponded to the dillerence in the extent of their contact with white 
culture. 

Similar results have been obtained with Oriental groups in America. 
Darsie (17) tested 570 American-bt'rn Japanese children between 
the ages of 10 and 15. Only. those children w^ho reported that English 
was the language most familiar to them w^ere included in this group. 
The linguistic diiliculties were iherefore not very pronounced, but were 
just such as might he commonly found among the children of immi- 
grants. On die Army lieta, a non-language test, tliere was no con- 
sistent diil'ereiicc in score between Japanese and American children, 
dhe vurceiion of the difference varied from one sub-test to another; 
the tola! secures showed no significant didcrence at ages 10 and I 1, 
be>'ond '.vhich ages the Japanese children excelled. 

Ihe Stanlord-Binet, however, yielded clear-cut differences. The 
median iO of die J jpanese group was 80.5, as contrasted with 09.5 for 
white children of tlic same disliicts. ITat this dilfcrcncc was attrib- 
utable to the verbal nature of the test was definilely demonstrated by 
a special analysis conducted by Darsie. Each individual test on the 
Stanford-Binct scale was ranked for degree of Japanese inferiority. 
The tests v^crc then rated independently by seven psychologists on the 
basis of the degree to which success on each depends upon verbal 
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ability. A final ranking of the tests was obtained by taking the average 
of the ratings by the seven judges. When these two sets of ranks — the 
one for Japanese inferiority and the other for ‘“vcrbality” — were com- 
pared, they were found to correlate -f .87. P\irther corroboration of 
this relationship was furnished by a comparison of the performance of 
Japanese and whiles on the separate tests. Thus the superiority of the 
American children was found to consist chiclly in their greater success 
on the linguistic tests. The Japanese surpassed the whiles, on the other 
hand, in certain non-verbal tests of the Stanford-Binel scale involving 
sustained attention and visual perception. ‘ 

DIlMCUn iKS Oi^ IHST ADMINISTRATION 

Jn addition to language handicap, other special difficulties arc encoun- 
tered in the attempt to administer tests to widely dilTering groups. The 
use of pantomime and gesture in non-language tests is oltcn confusing 
to the subject since it is not his normal mode of communication. This 
is illustrated in certain observations regarding the administration of 
the Army Beta to the Negro draft during World War !. Several exam- 
iners called attention to the fact that it was difficult to keep up their 
subjects' interest in Hie test. In the report from one camp, it wa.-> stated 
tliat 'fit took all the energy and enthusiasm the examiner could muster 
to maintain the necessary attention, as there 'vas a decided disposition 
for the Negroes to lapse into inattention and almost into sleep'’ (64, 
p 705). One of the reasons offered in explanation of this reaction 
was the ariificialily of the situation produced bv the elimination of 
language, it is also difficult to standardize directions given in panto- 
mime and to insure that they she ll always be repeated in identical 
fashion. 

The use of pictures as test materials is also somewhat que.stionable, 
especially in cultures which provide no experience with pictorial rep- 
resei.tation in everyday life. A two-dimensic)nal reproduction of an 
object is not a perfect replica of the original, it simply presents certain 
cues which, through the influence f past experience, lead to the 
perception of the object. If the cues arc highly reduced, as in a simpli- 
fied or schematic drawing, or if the necessary past experience is 
absent, the correct perception may not follow. It might be added that 

The Japanese thikhen \^ci- sigtiifkanily superior in the Indutlion, Papet Cutting, 
Code ixarning, and Lndosed hoxes tests. 
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pictures of objects which are themselves unfamiliar in the cultural 
group to be examined are obviously unsuitable as test materials. They 
have, nevertheless, been included in certain non-language scales which 
have been employed in racial comparisons. 

The so-called culture-free tests, such as the International Group 
Mental Test (19), R. B. Calteirs Culture-Free Intelligence Test (14, 
15), and the Leiter International Pcrlormance Scale (41), make a 
deliberate attempt to include only content which is univei sally famil- 
iar in all cultures. In actual practice, ho^'cver, they still lall short of 
such a goal (cf. 43). In so lar as they employ pictorial representation, 
these tests ma) also tavor certain groups unduly. Mention may like- 
wise be made ol tests designed to measure individual dilTercnccs 
within cultures quite unlike our own, such as the Fiji Test of General 
Ability prepared by Mann (42) Such tests do not, of course, lend 
themseKes to inter-cultural comparisons. 

A further problem arises in connection with rapport and motiva- 
tion. Accepted te Ting ‘practice demands tint the examiner establish 
rapport with his subjects. By this is meant, in gciicr.il, lluit the sub- 
jects should be put at their ease, thei»* interest and cooperation should 
be secured, and they should be m*K!e cairn and ccMiitortable lieforc 
the test is begun. In othci words, it is assumed that each subject m\\ 
be in a condition to do his best. In an individual test a definite efiort 
is usually made to establisJl rappoil with the >ubject. With group 
tests, however this is much more dillicuit. The examinei in su'di a 
case must limit himwdl to a lew reassuring introductoiy remarks and 
to the elimination of any obvious handicaps under which individual 
nieinbers ol the group may be laboring. 

When an cxtiniinei fioin one cultural or lacial gnmp administers a 
test to subjects in a diflercnt group, rapport is even poorer, the situa- 
tion being muLh moie strained and unnatural lor the subjects than 
when thev aie tested by a member of Ihcii own group. Tins is particu- 
larly noticeable in the testing of American Indians and Negroes by a 
white examiner. An interesting illusUation ol the elfect of the 
examiner-subject relationship is to be found in a study by Caiiiidy 
(12), in which 4S Negro and 25 white school childien w'erc given 
the Stanford-Binet by both Negro and white examiners. Some of the 
subjects in each racial group w^cre tested by the white examiner first, 
and some by the Negro examiner fil‘^t. In both white and Negro 
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groups, the mean 10 was about six points higlicr when the subjects 
were tested by an examiner of their own race. 

Quite apart from any racial disparity, the presence of a stranger 
will in itself occasion more emotional disturbance among the mem- 
bers of certain cultures than it would among American city school 
children, who are accustomed to sudden visits fiom a succession of 
supervisors, research workers, psychologists, and others. Furthermore, 
the suspicion and hostility manitested by m.my “primitive" peoples 
toward strangers will necessarily afiect the individuars altitude and 
responsiveness toward a foreign examiner. 

DiriLKLNCTS IN SC UOOI ING 

It is well known that the educational facilities available to the indi- 
vidual vary widely Uom one racial or national gioup to aiunlisr. Ihis 
is apparent even it we consider only the loial duration of schooling. 
In certain rural sections ot the United States, for e sample, the school 
year is drastically shoiteiieJ, sometimes to as little as si\ months The 
inei:nl(int\ of school attendame pre\a1ent in certain groups, such as 
the American Indian, reduces still further the ettective length of time 
devoted to instruction. Fin<ill>, the quail i\ of the available training 
and the conditions under which it is obtained eminoi be ignored. In 
general, it is just in those gioups which receive the least schooling 
that the quality of instruction is poorest The type of education offered 
in riual Negro schoob ot the South, for example, is tar inferior to 
that in the average white yubne school. To equate years ot schooling 
does not eliminate educational dillerenccs between such groups. 

It is now generally recognized that intelligence tests are not inde- 
pendent ol educational b«ickgioiiiK' It wall be recalled tliat such tests 
are often validated against school orogress as a criterion (cf Ch. 2), 
and that they correlate nearly as highly with tests of school achieve- 
ment as they do with other intelligence tests (cf. Ch. 14). Ihis corre- 
lation is even higher within groups whose educational background b’ 
relatively heterogeneous. For cxaairle, in a survey of about 2000 
urban Negro school children in Texas and Oklahoma, a correlation of 
.81 was found between an intelligence test and a test of school achieve- 
ment (27). 

It will be recalled that a high correlation between amount of school- 



728 Differential Psychology 


ing and intelligence test score was found among the soldiers tested in 
both world wars (cf . Ch. 8 ) . A further analysis of Army Alpha scores 
and academic level, for both Negroes and whites tested during World 
War I, is given in Table 46. Within any one group, there is a con- 
sistent rise in median Alpha score with increase in amount of educa- 
tion. That dilTercnces still exist when comparisons are made verti- 
cally, within a single educational class, is attributable to a number ot 
factors. Chief among these arc differences in the quality of education, 
a factor which is ignored in the system ol classiffcalion here employed. 
Diff erences in the socio-economic level of the home as well as in other 
more general conditions may also be mentioned in this connection. 
But it is apparent that the differences in score arc much larger when 
we read across the table than when we read down any one column. 
In other words, dilTerences associated with educational level are of 
a much higher order than differences associated with race or foreign 
birth. 


TABLE 46 Median Army Alpha Score of Men with Di fit tent 
Amounts of Schoi>!in^e 
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draft 21.4 

47.2 

72.4 

91.9 

Northern Negroes 

J7.0 

37.2 

71.2 

90.5 

SoLilhein Negiocs 

7,2 

16,3 

45.7 

63.8 


To be sure, correlation never proves causation. 1 he fact that intelli- 
gence test scores rise with educational level does not in itself tell us 
which IS cause and which effect. It can be argued that the more 
intelligent individual will be more successful in his school work and 
will pursue his education further than the less intelligent. Intellectual 
differences may thus be the cause rather than the effect of educational 
differences. In such a case, persons in the higher educational group- 
ings would represent a more highly selected sampling from the outset. 
This explanation, although partially applicable to individuals within 
a group, appears far-fetched when applied to different racial and 
national groups. When opportunities for continued education or for 
satisfactory instruction at any level are so unlike from one group to 
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another, failure to obtain such education cannot be attributed to in- 
jerior intelligence. 

The effect of educational handicap upon intelligence test perform- 
ance is especially apparent in the American Negro. Since his native 
language is English, the Negro is frequently tested with the common 
verbal type of intelligence test. Because of his pronounced educa- 
tional deffcicncy, however, the Negro has a very limited command of 
the language, as well as serious gaps in other fields of knowledge. 
Both of these factors would seriously alter the interpretation of scores 
on common intelligence tests. 

Of interest in this connection is the finding that 87% of the Negro 
rind 84% ol the white soldiers assigned to the army's Special Training 
Units during World War 11 completed the training successfully (61). 
This course consisted of an intensive educational program designed 
to bring illiterates up to a fourth grade level in reading and arithmetic 
and to give' them a minimuiu degree of proficiency as soldiers. Some 
completed the course in as little as three weeks; others required as 
much as thirteen or sixteen weeks, but the average duration was eight 
weeks. These data do not, of course, have any bearing upon Negro- 
white ditjcreuccs in learning ability, since individuals were selected 
for admission to this program on the basis of promise (22, 62). It 
was principally those who had been deprived of adequate educational 
opportunities who were assigned to the special training. The similarity 
in per cent of successes among Negroes and whites simply means that 
the prognostic indices employed to select individuals were about 
equally effective in both groups. What the results actually show is 
that, through an intensive educational program, large numbers of indi- 
viduals of both races were able to make remarkably rapid progress in 
the type of functions measured by jntelligcnLe tests. 

SOCIO-ICONOMIC LFVTL 

It is apparent that the economic, social, and cultural level of the homes 
of such groups as immigrants, Ne‘n-oes, or American Indians is on 
the whole far below the general American average. One of the first 
investigations designed especially to determine the relative contribu- 
don of socio-economic diff erences and of racial or national differences 
to IQ was conducted by Arlitt (1), I’he Stanford-Binct was admin- 
istered to 191 American children of native-born white parents, 87 
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children of Italian immigrants, and 71 Negro children. All the sub- 
jects were taken from a single school district and all spoke English 
with no apparent difficulty. Each child was classified on a 5-point 
scale on the basis of father’s occupation, which was taken as an 
approximate index of the socio-economic level of the home. 

The median IQ of each group is shown in Tabic 47. When the three 
“racial” groups are compared as a whole, the children in the immi- 
grant and Negro groups fall 21 to 23 points below the group of native- 
born white parentage. The differences in occupational level among 
these three groups were, however, very large. Over 90% of the Ital- 
ians and Negroes fell into the semi-skilled and unskilled categories. 
When only the children in these two occupational levels are included, 
the median IQ of the native white group drops to 92.0. Thus the 
intellectual differences among the three racial groups are reduced to 
a very small quantity when comparisons are restricted to children of 
roughly the same socio-economic level. 


TABLE 47 Socio-Economic versus Natio-Racial Factors in Relation 
to Children\s IQ*s 

(From A. lift, 1, IPI-ISJ?) 


Group 

Median IQ 

Ci roups of 

Tolal Groups Roughly Comparable 

Occupational Level 

Native white parentage 

106.5 

92.0 

Italian parentage 

85.0 

85.0 

Negro 

83.4 

83.4 


It might again be objected that we cannot determine which is cause 
and which is effect in the relationship between intellectual and occu- 
pational level. Since, however, the opportunities for employment in 
higher positions are far from equal for native Americans and immi- 
grants, and this difference is still greater when Negroes are considered, 
it seems unwarranted to attribute the lower occupational status of the 
latter groups to inferior intelligence. 

Several investigations on the relationship between socio-economic 
factors and IQ among Negroes in America have contributed toward a 
clarification of the nature of this relationship. First, it should be noted 
that the differences in intelligence test scores among occupational 
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classes tend to be smaller for Negro than for white children. Thus in 
a survey of third grade Negro school children in Washington, D. C., 
Robinson and Meencs (57) found a 13- to 14-point difference in 
mean IQ between the children of laborers and the children of pro- 
fessional men. Among white children, this difference is generally 
about 20 points (49). Moreover, the mean IQ's do not follow the 
occupational hierarchy so closely among Negro as among white chil- 
dren, but tend rather to fall into a dichotomy, with clerical, business, 
and professional occupations in the upper category, and skilled and 
unskilled labor in the lower (13, 52, 57). It is likely that the socio- 
economic level of Negro homes is less closely related to occupational 
class than is true of white homes. The range or heterogeneity of white 
homes is undoubtedly much greater than that of Negro homes. The 
difference between the remuneration of a Negro in business or pro- 
fessional work and one in the skilled trades is probably much smaller 
than that between whites in the corresponding occupational cate- 
gories. Restricted vocational opportunities would also mean that at 
least some Negroes with sufticient ability and education to hold a 
higher level job might be engaged in low^er level occupations. All 
these conditions would tend to reduce the differences in the IQ’s of 
Negro children from different occupational classes. 

Some corroborative evidence for these interpretations is provided 
by the previously mentioned study of Robinson and Meenes (57). 
The Kuhlmann-Anderson IQ's of 444 third grade Negro children 
attending Washington, D. C., public schools in 1938-39 were com- 
pared with those of 491 ’''ogr > cliildren in the third grade of the same 
schools in 1945-46. In the latter year, when vocational opportunities 
for Negroes were better, a closer correspondence between paternal 
occupation and child’s 10 was found. Moreover, the mean IQ of the 
entire group was higher in 1945 46 than in 1938-39. The latter find- 
ing may also be related to the improved socio-economic status of the 
second group. 

That occupational level may not be as diagnostic of general home 
conditions among Negroes as among whites is likewise indicated by 
the fact that moie direct measurcj of socio-economic level sliow a 
closer relation to Negro children's IQ’s than does parental occupation. 
Using the Sims Score Card, which is based upon a variety of home 
characteristics, CJldham (50) found a consistent and clear-cut rise in 
Negro children’s IQ’s from the lowest to the highest socio-economic 
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/cvels. Similarly, Robinson and Mcenes (57), in ihc previously cited 
study, report high correlations between the mean 10 of children in 
each of the schools in the survey and such factors as the average 
rental or the frequency of radios in the community. All these findings 
suggest that parental occupation is not as valid (;r adequate an index 
of socio-economic background for Negro as for white children. 

The relation between IQ and socio-economic factors tends to rise 
with age in the case of Negro children (60), as it ci(K\s in the case of 
white children (cf. Ch. 23). Ihe interpretation of sucli a rise is, how- 
ever, ambiguous since the tests do not measure the same functions at 
dilTcrent age levels. In a comparison of Negro and white infants in 
Florida in 1031, McGiaw (45) found that the whites e^veelled on the 
Biihler Babytests. The wliite infants, however, were iiho taller and 
heavier than the Negro infants, a physical difference which could have 
resulted from inequalities in prenatal and postnatal care and nutrition. 
That diflercnt samples of ihc same racial group, which are livir'.g under 
dilTercnt physical conditions, may dilTcr in their physical development 
in infancy and childhood has been demonstrated for various gioups, 
including Mexicans (29 ) and Europeans (47). In Chapter 1 1 vve have 
already considered the possible effects of maternal nutrition upon Ivolh 
prenatal and postnatal development. DiiTcrcnces in the rate and level 
of physical development may in turn affect the infant's behavior, 
especially the simple sensopi-motor functions which predominate in 
psychological tests at the infant level. 

A study similar to that of McGraw was conducted in 1946 by 
Pasanianick (51 ) with Negro and white infants in Connecticut. In this 
case the Negro infants did not differ significantly from the whiles in 
cither physical or psychological development. 4 he investigator attrib- 
utes such a finding to the fact that the Negro maternal diet in Ibis 
group had more nearly approached wdiite standards. 

In investigations on American Indians, the relatively^ low socio- 
economic level of die home is an important factor to be considered 
in the evaluation of their test performance. Opportunities for intellec- 
tual development, as well as the general level of material comfort are 
far below the average for American homes. Thus, in the study by 
Jamieson and Sandiford (37), the homes of the Indian children re- 
ceived an average rating of only 13 points on the Chapman Socio- 
Economic Scale, as compared to the white norm of 56 points. A dose 
correspondence between the social status of various Indian groups 
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and their relative standing on the National Intelligenee Test was found 
by Garth (25). 

Not only intelligence, but also emotional adjustment and other per- 
sonality characteristics may be related to socio-economic factors. For 
example, surveys of children’s play activities have shown that Negro 
children tend to engage in group play relatively more often than 
white children (40). Bciore attributing such a finding to a greater 
“sociability” of the Negro race, it is well to consider that crowded 
housing and inadequate facilities for many other types of play may 
account for part or all of such a dillercncc. 

Jn a personality inventory survey of 1647 children between the 
ages of 9 and 15, Brown (7) compared the scores of several sub- 
gioups, including: urban and rural; low, middle, and high socio- 
economic levels; and Jewish. Slovak, and native American non- 
Jewish. Adjustment scores proved to be much more closely related 
to socio-economic level than to either “racial” group or urban-rural 
residence. Statistically significant dilTerences in such scores were found 
between the different socio-economic groups, but the dilTerences 
among the other Opes of groups investigated were conM^tsntl} small 
and insignificant. Similarly, in a survey of abc'ut 60. 000 selectees 
examined at one induction station during World War JI, the rate of 
rejection for various mental disorders varied with population density 
and with socio-economic rating of the community (35, 36). 1 he rate 
also differed among various national and racial grou]')s, but those 
natio-racial groups with the highest rcjectum rate came from com- 
munities with the highest population density and lowest socio- 
economic level ( 34 ) . 

TRADITIONS AND COSTOIvIS 

The particular culture in which the individual is reared may influence 
his behavioral development in many ways. Ihe operation of environ- 
mental forces is not limited to the extent and quality of educational 
opportunities available in the school, the home, and the neighbor- 
hood. The question is not only one of amount, but of kind. The expe- 
riences of people living in different cultures may vary in such a way 
as to lead to basically different perceptual responses, lend a different 
meaning to their actions, stimulate the dcvelc^pment of totally diflerent 
interests, and furnish diverse ideals and standards of behavior. 
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The importance of motivation and interest in intelligence test per- 
formance has been repeatedly emphasized. Yet it is apparent that 
many of the tests in current use cannot arouse the same emotional 
reaction in other cultures as they do in our own. Thus for an American 
school child the average intelligence test bears a close resemblance 
to his everyday school work, which is probably the most serious busi- 
ness of his life at the time. He is therefore easily spurred on to exert 
his best efforts and to try to excel his fellows. For an Indian child, on 
the other hand, the same test cannot have such a significance. This 
type of activity has no place in the traditional behavior of his family 
or tribe. Similarly, many investigators have noted that among Negro 
children interest in intelligence tests is not as keen as among while 
children, and that the former seem not to be as strongly motivated as 
the latter. 

Such differences in motivation are not necessarily limited to test 
situations, but may exert a broader influence upon achievement in 
school and in other everyday life situations. Several theories have been 
proposed, for example, regarding the reaction of the American Negro 
and other minority groups to socially^ imposed frustrations. Dollard 
(20) and Maslow and Mittelman (44) have suggested that the Negro 
may assume an attitude of stupidity and lethargy as a dcfeitse mech- 
anism against frustration and oppressiim. According to these writers, 
such an attitude would prov'de a sort of revenge and enable the indi- 
vidual to avoid disagreeable responsibilities. Similarly, Brown (10) 
has argued that the linguistic development of the Negro may be 
hindered by social pressures which inhibit verbalization. Inarticulate- 
ness reduces the possibility of incurring the hostility of the dominant 
social group, and might thus be '"cultivated” as a measure of discretion. 

In addition to emotional and motivational factors, specific local 
manners and stK'ial usai^cs may inlluencc the subject’s performance 
on a psycliological test. Several striking examples of such traditional 
behavior have been reported. Thus Porteus (55), in administering 
performance tests to Australian aborigines, found it difficult to con- 
vince his subjects that they were to solve the problems individually and 
without assistance. In explanation of this behavior, he writes: 

. . . the aborigine is used to concerted thinking. Not only is every 
problem in tribal life debated and settled by the council oi elders, but it 
is always discussed until a unanimous decision is reached. On many occa- 
sions the subject of a test was evidently extremely puzzled by the fact 
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that I would render him no assistance, especially when, as happened in the 
centre, 1 was testing some men who were reputedly my tribal brothers- 
This was a matter which caused considerable delay as, again and again, 
the subject \\oijld pause for appioval oi assistance in the task ( p. '^08) . 

Similarly, Klincbcrg (39) reports that among the Dakota Indians 
it is considered bad lorni to answer a question in the presence ot some- 
one else who docs not know the answer. This creates a particularly 
dilhcult situation in school, where the teacher'- find it dilficult to induce 
the children to recite in class. In the same group, custom ioibids one 
to answ'cr a question unless he K absolutely sure of the answer. The 
elTcct w'hich this would ha\c upon intelligence tests, in which the sub- 
icet is advised to "'guess” when not sure and is urged to “try his best” 
on a diilicult probLm, can be readily loresccn. The child who reiuscs 
to give any answer unless he is certain its correctness will lose 
many points which he might ha\c earned through partial credits and 
chance successes. 

Another medium through which cultural background m<iy influence 
test perform, ince is to be lound in the special ass(>(ia/i()ns and mean- 
in^s which ha\c besii built up by social conditioning. In one ot the 
sub-tests of the National intelligence lest, the child is required to 
underline the two words which tell what the given item always has. 
One of the examples in thi'. test reads: 

C lowd (closeness, danger, dust, excitement, nunibLi) ’ 

Although “closeness” and “number” are gi\cn in the key as the correct 
answers, it was found th„ aiiivjng PI. in> Indians “dangei” and “dust 
or c\en “cxciicment,” wuic Ircquently undeiscorcd. dhc experience 
which these children liad had witt crowds on the prairie had taught 
them that these wcic nccessiuy alt ihutes of a crowd ( *f. 23). 

Many other instances ot such culturally determined associations can 
be found in intelligence test performance (cf., c.g., 39). In one of the 
tests ol tlic Army Alpha Form 6, occurs the question, “Why should 
all piiicnts be made to send their children to schooF” Of the several 
alternative answers given, the “co'i x t” one is that ‘Nchool prcpaies 
the child for his later life.” But thi> is not true for the Indian child, 
whose schooling often unfits him for life on the reservation. Similarly, 
in a sentence completion test of the National Intelligence Scale is 
found the statement, “ should prevail in churches and libraries.” 

Scale 4, torn 1 Test Fxcrcise 17 



736 Differential Psychology 


The word to be inserted in this case is “silence.” Among Negro chil- 
dren, however, this problem would be complicated by the fact that 
their own churches are seldom silent. Noise is not only common in 
their houses of worship but is frequently an integral and essential part 
of the ritual. 

A further example of llic inapplicability of a psychological test to 
groups dilTcring In^ii the one upon which it was standardized is fur- 
nished by an incident which occurred m the testing of children in the 
Kentucky mountains.^’ The following is one of the problems in the 
Stanlord-Bincl Scale: “If you went to the store and bought 6 cents’ 
worth of candy and gave the clerk 10 cents, wdiat change would you 
receive?” One alert young boy, upon being asked this question, 
replied, “1 never had 10 cents, and if I had I wouldn't spend it for 
candv, and anyway candy is what your mother makes.” Still wishing 
vO find out if the child could subtract 6 from 10, the examiner re- 
formulated the problem as follows: “If you had taken 10 cows to pas- 
ture lor your father and 6 of them strayed away, how many would 
you have left to drive home?" The child now replied promptly, “We 
don't have 10 cows, but if we did and I lost 6, I wouidn't dare to go 
home.” rite examiner tried once more with tlic follow ing^inquiry: “If 
there were 10 children in a school and 6 of them were out with the 
measles, how many would there be in school?” This answer came 
even more promptly: “None, because the rest would be afraid of 
catching it too.” 

linally, mention should be made of the important role of speed in 
nearly all intelligence tests and of the widely varying empluisls placed 
upon speed in dilTcrent cultures. An investigation by Klineberg (38) 
on Indian, Negro, and wiiite school boys illustrates the operation of 
this factor. Several of the tests in the Pintner-Paterson Performance 
Scale W'cre administered to the following groups: 

1.3t) Indums iiticndmg Haskell Instmitc in Kansas 
IZO Indians at the Yakima Reservation in Washington 
107 whites in rural Washington, near the Indian reservation 
139 Negroes in a rural disinct of West Virginia 
25 whiles in the same district of West Virginia 
200 Negroes in New York City 
100 whites in New York City 

'"This incident is reported in Piessey, S. L., Pwchohyiy and the New Education, 
N. Y.: JJaipei, 1933. Pp. 237-238. 



Racial Comparisons: Problems of Measurement 


737 


In accuracy of performance, as measured by the number of errors 
on each test, the Indians excelled the whites, and the Negroes were 
either equal or slightly superior to the whites. All measures of speed, 
on the other hand, favored the whites. A comparison of groups of the 
same race hut living in (liffercnt environments suggested that these 
difierences in speed were cultural rather than biological. Thus the 
New York City Negroes clearly excelled the West Virginia Negroes in 
every comparison. Similarly, the Haskell Institute Indians were con- 
sistently faster than those tested on the Yakima Reservation. A fur- 
ther division of the Haskell group into those who had previously lived 
on a reservation and those who had lived among whiles in a town or 
city showed the latter to excel in speed. 

In explanation of these results, Klinebcrg calls attention to the rela- 
tively insignilieant part which speed plays in the lil'e of the reservation 
Indian or the rural southern Negro. Most observers are impressed 
with the IndianN almost Cinnpletc lack of concern with speed, l ime 
means nothing in the daily activities of the Indian. He can see no 
reason for hurrying through a task, especially il he finds it congenial 
and interesting. Thus in so far as the examiner arouses the child’s 
interest in the test, he makes the necessity of speeding appear evon 
more absurd. At Haskcli Institute, on the other hand, time is much 
more important than on the reservation. The students are constantly 
kept busy with a variety of tasks and the entire day is carefully sched- 
uled. The white teachers, too, foster the attitude that it is desirable 
to finish tilings as quickly as possible. Similarly, the New York City 
Negroes have been expi ‘d to the hustle of life in a big metropolis, 
whereas the rural Negroes are adapted to a much sIowxt tempo of 
activity. 


TUh C RITERION OF “IMTXLK TOM. SUPMUORITY’' 

In all group comparisons, tlicrc is a tendency to go beyond the ac- 
tually observed dijjercmcs in behavior au l to evaluate the relative 
status of each group in terms of s^mie presumaoly universal critcriem. 
Linear comparisons are made in Urms of better or worse. Thus we 
frequently find national or racial groups arranged in a rank-order for 
‘Intelligence.” One group is said to be “superior,” another “inferior” 
in its mentality. Such a point of view implies either that one group 
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is consistently poorer than another in all intellectual traits, or that 
certain behavioral processes are universally more significant, more 
valuable, or even more “intellectual” than others. 

Specificity of Group Differences. In regard to the first of these 
assumptions, it can easily be shov^m that racial or national groups 
vary in the relative inferority or superiority which they manifest in 
different trails. Group differences arc specific, not general. Thus Japa- 
nese children have been found to excel American children signifi- 
cantly in tests involving sustained attention, visual perception, or 
spatial orientation, while falling behind on verbal or arithmetic tests. 
This was demonstrated in Dardc’s study (17) by the relatively su- 
perior pcrlormaucc of the Japanese children on four of the Slanford- 
Binct tests, viz.. Induction, Paper Cutting, Enclosed Boxes, and Code 
Learning, as well as on the Digit-Symbol Learning and the Number 
Comparisons tests of Ihe Army Beta. A slight supiiiority was also 
shown by these children in the Cube Analysis and Geometric Con- 
struction tests of the Beta wScale. 

Jhc relative standing of American Indian childien on performance 
and on verbal tests has already been discusser! in connection with 
language handicap. It will be lecalled that on performance tests In- 
dian children usually average about as high as white children. Several 
studies with the Goodenough Draw-a-Man Test have shown that 
Indian children score e\en better on this test than on most perform- 
ance scales (18. 31, 58). A number of Indian groups have obtained 
higher average IQ’s than white groups on the Goodenough test. In a 
study ot boys and girls from a Hopi Indian school, Dennis (18) 
reports an interesting sex difference on this test, which appears to be 
related to cultural factors within this group. The girls received a mean 
IQ ot 99.5, the boys 116.6. Dennis attributes this sex difference to 
the fact that in the Hopi culture graphic art is traditionally a mas- 
culine concern, and consequently the boys develop more interest in 
art and have more practice in it than the girls. A similar sex difference 
in Goodenough score has been observed in other Indian communities 
which foster such a traditional sex distinction in artistic pursuits ). 

Differences in specific traits have likewise been found in compari- 
sons among European immigrant groups in this country. Jewish 
children, for example, usually excel on verbal tests and fall behind 
in problems dealing with concrete objects and spatial relations. In a 
study conducted with kindergarten children in the Minneapolis public 
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schools, the Stanford-Binct was administered to groups of Jewish and 
Scandinavian children equated in age, sex ratio, and socio-economic 
status (9) The Jewish children were found to be super ior on tests 
based upon general information and verbal compreliension, while the 
Scandinavian cliildren excelled on tests requiring spatial onentation 
and senson-motor coordination. Similarly, in an analysis of the ^CE 
scores ot Jewish and non-Jewish freshmen at the University of P-Us- 
burgh, the Jewish bo\s were found to csecl the non-Jewish on the 
^mgul^tic part ol the lest (33). "I he Jewish boys also did relatively 
better on the hnguiaie than on the quantitati\e parts ol the test, while 
the reverse was true ol the non-Jew ish bo}N. Surveys of American- 
born clnldren ol Italian immigrants have generally shown that the 
children do relatively well on perfoimance tests and relatively poor’y 
when examined with abstract or linguistic materials (6). 

Such dillerenees in intelhgenee test pcrloimancc among various 
immigrant groups may, of course, he accounted for partly on the 
basis ot the dillerential language handicaps discussed m an earlier 
section rultuial liaditums, however, undoubtediv ])la> a ni qor pait 
in producing these group dillerenees in intellectual development. In 
Jewish lamihes, there is a eharaeteiuticaH'/ mriiked emphasis upon 
the formal aspects (^f education and upon ‘hibslract'" intelligence, to 
the almost total neglect of '‘mechanicar’ intelligence and manual dex- 
teritv. Italians, on tlie othei hand, have traditional and age-old 
admiration for rnanipulaiive arts and cralts. The skill evhibitcd in 
the production of a beaiU’ful obicet, a complex object, or an object 
wxdl adajMed to its juacti^al use is held in high esteem and cncour 
aged from carl> childhood. Relatively little emphasis, however, is 
placed upon the moic abstract l\| s of tiilcnt. 

f he specificity ol intei-group dii.erencc’s in behavior was recognized 
by Porteus on the basis of his extensi'c observation and testing of 
vanous tacial croups in Hawaii. Australia, and Africa (56). For 
exainj'i’e, he repoils that the ( ninese groups which he surveyed ex- 
celled the Jajaanese in tests of the Biiu't t\pc and in auditor} memory 
span. But the Japanese excelled '1 ' Chinese in the Porteus Maze 
Tests and in all performance and mechanical aptitude tests. Similarly, 
Australian aboriginals srnied relatively well on the Porteus Mazes, 
but tell below the Aliican and Asiatic groups in any test depending 
upon speed. Although inclining towarrd a hereditarian interpretation 
of race differences, Poitcus concludes: “Among the racial groups 
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mental development is not even. Advantages in certain tests arc bal- 
anced very often by weaknesses in another” (56, p. 74). 

CuUurai Hierarchy of Behavior Functions. It might be suggested 
that racial or national groups could be arranged in a consistent hier- 
archy if we considered only the ‘'higher mental processes.” 1 ests oi 
abstract abilities, for example, are usually considered to be more 
diagnostic of “intelligence" than those dealing with the manipulation 
of concrete objects or with the perception of spatial relationships. 
The aptitude for dealing with symbolica’ materials, especially of a 
verbal or numerical nature, is likewise regarded as the acme of intel- 
lectual attainment. The “primitive” man's skill in respemding to very 
slight sensory cues, his talents in the construction ot objects, or the 
powers oi sustained attention and muscular control which he may 
display in his hunting behavior are regarded as interesting antlu'opo- 
logical curios which have, however, little or no intellectual worth. 
As a result, such activities have not usually been incornorated in 
intelligence scales, but have been relegated to a relatively minor 
position in mental testing. 

Upon closer analysis it will beccunc apparent that this c(meeption 
of intellect is itself culturally conditioned. By “higher mental proc- 
cs.'.cs” is usually meant those aspects or segments oi behavior which 
are at a premium in our society. Intelligence tests would be very 
ditferent if they had been i^onslructed among American India.ns or 
Australian aborigines rather than in American cities. The criterion 
employed in validating intelligence tests has nearly always been suc- 
cess in our social system. Scores on the test aie correlated with school 
achievement oi perhaps with some more general measure of success 
in our society. li such corielations are high, it is concluded that the 
test is a good measure of “intelligence." The age eritericni is based 
on the same principle. If scores on a given test show a progressive 
increase with age, it may simply mean that the test is iiKasiuing those 
traits w'hieh our culture imparls to the individual. The older the sub- 
ject, the more opporlunhy he will have had, in general, to acquire 
such aptitudes. 

Thus it would seem that our intelligence tests measure only the 
ability to succeed in our particular culture. Each culture, partly 
through the physical conditions of its environment and partly through 
social tradition, “selects” certain activities as the most significant. 
These it encourages and stimulates; others it neglects or definitely 
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suppresses. The relative standing of dilTcrcnl cultural groups in “in- 
telligence” is a function of the traits included under the concept of 
intelligence, or, to slate the same point dilTerently, it is a function of 
the particular culture in which the test was constructed. 

Since the current intelligence tests are a characteristic American 
development and since the testing of racial groups has been et>nducted 
largely by American psvehologists, inter-group comparisons have gen- 
erally been made with tests standardized within our culture. On such 
tests it is not surprising that most comparisons favor American subjects. 
Wiiat would happen if a test were constructed in a dilTercnt culture 
by a procedure analogous to that followed in the preparation of our 
own tests? i he instances in which this has been a.ttemptcd are rare, 
but the results are enlightening. 

In the course of an investigation by Klinebcrg (cf. 39) among the 
Dakota Indians, a "beadwork tesk’ was devised in whi.:h a small 
sample of beadwork was shown to the subjects for four minutes; 
the sample was then removed and the subjects asked to reproduce it 
from mcmeny on a loom. The test was applied to both white and 
Indian giris, all ol whom were first taught liow' to do beadwork on a 
loc'm. As would be expected Irom their greater familiarity wath this 
type of material, the Indian girls clearly surpassed the whites. Simi- 
lar!}, F. D. DuBois (21) standardized a Draw a Horse lest on 
Indian children, following closely the procedure of the Goodenough 
Draw-a-Man lest In lerm.> oI age- grade placement and other 
criteria of "inlc!ligencc,” the hv»rse test proved to be more valid than 
the man test for these hildren. Morcvwcr, when both tests were 
administered to white and Indian children, the whites excelled on the 
man-drawing and the Indiana on the horse-drawing test. On the basis 
of the latter test, the 1 1-vear old white boys lestcd in this study would 
have obt. lined an average 10” of 74! 

Fortcus (5h ) tried a similar experiment while working among the 
Australian aborigines. Having been impressed with the remarkable 
tiaeking skill ol these people, he construeiLd a test with photographs 
of footprints, the task being to ^M'Uch the two prints made by the 
same foot. On this test, the Austivuians did practically as vveil as a 
group of 120 white high school students in Hawaii who were tested 
for comparison. In commenting upon these results, Porteus remarks: 

Allowing lor their unfamiliarity with photographs vve may say, then, 
that with test nialerial with w'hieh they are familiar the aborigines' ability 
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Racial versus 
Cultural Differences 

It is apparent that the cornparison of either the everyday achieve- 
ments or the psychological lest scores of different racial groups cannot 
in itself provide valid informatiori regarding race differences in abili- 
ties or personality. We have seen that because of large group dilTer- 
erices in many important aspects of the psychological environment, 
tests designed for one group may not have the same diagnostic 
significance when applied to another group. For example, an intel- 
ligence test standardized on white American urban school children 
is a valid measure of intelligence only for white American urban 
school children. Even if our only aim is to predict hgw well the 
individual will progress academically within the while American urban 
culture, this test could not adequately make such a prediction when 
given to a child reared under different conditions. An 10 of 60 on 
such a test would not have the same meaning or diagnostic signifi- 
cance when obtained by individuals with dilferent environmental back- 
grounds. riuis if such an IQ were obtained by a white child of 
professional parents, it might indicate some structural deficiency 
which prevents normal intellectual development. But if the same IQ 
were obtained by a Negro child from a poor and isolated rural com- 
munity, it might simply mean illiteracy. The prognosis for intellectual 
improvement in these two cases, given equal subsequent educational 
opportunities, would certainly be quite different. 

To the extent that any two groups differ in both racial (biological) 
and cultural (environmental) factors, the mere comparison of their 
test performance yields ambiguous results. Such results not only fail 
to answer the theoretical question regarding the presence or absence 
of racial differences as such, but they arc also of questionable value 
for the practical problem of diagnosis and prediction. Such predic- 
tions could be validly made only on the assumption that environ- 
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mental group differences are frozen or unchanging, a condition which 
is not at all characteristic of modern civilization. 

In an effort to devise more fruitful experimental designs for the 
study of race differences in psychological functions, a number of 
investigators have combined intra-racial with inter-racial compari- 
sons. Such an approach, although still falling short of a rigid control 
of conditions, permits a somewhat better analysis of group differences 
than is possible by a simple comparison of racial groups as a whole. 
Under this category may be included psychological studies of hybrid 
groups, in which the degree of race mixture is compared with psycho- 
logical test scores. Investigations of regional differences and migra- 
tion likewise permit comparisons among groups of the same race 
living under different environmental conditions. An especially prom- 
ising procedure is represented by cross-comparisons among racial 
and national gi\jups. Such comparisons can be made in many Euro- 
pean nations, whose populations are composed of several subdivisions 
of the Caucasian race (cf. Ch. 20). The representatives of these 
racial sub-groups living within a single nation arc, in general, exposed 
to relatively uniform cultural conditions. To a greater or less extent, 
they share the social traditions and customs of their country. wSince 
all arc members of the ''while'’ race, social distinctions and discrimi- 
nations are far less prevalent than is the case with Negroid or Mon- 
goloid groups living among Caucasians. 

Some investigations have been especially concerned with racial 
versus cnJiural factors in the dexelopnu nt of personality. In such 
studies, cross-comparisoir have also been made among racial and 
cultural groups, the latter groupings sometimes being represented by 
nationality and sometimes by mor- homogeneous and more narrowly 
defined cultural units An interest. ng illustnUion of the relative role 
of cultural and biological factors in the development of characteristic 
group behavior Is provided by studies on gesture and cultural as- 
similation. In the sections which follow, we shall consider typical 
investigations in each of the areas cited ab )\e. 

PSYC nOI.OGIC At. STUDIES OE HYHRID GROUPS 

It has been argued that if one racial group is “biologically superior” 
to another, a mixture of the two should produce individuals who are 
intermediate between the superior and inferior stocks. Moreover, 
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the greater the contribution of the superior race to the individuars 
heredity, the higher should be his intellectual level, according to such 
a view. It should be noted, however, that this problem, too, has its 
complications. In the first place, race mixture is often selective. This 
is especially true of those mixtures which are discouraged or frowned 
upon by society. In such cases, miscegenation may be confined largely 
to the socially and educationally inferior members of both groups. 
More often, perhaps, the selection occurs only in the group which 
is socially dominant, there being rekuively little prejudice against 
such a mixture among members of the socially less favored group. 
This was doubtlessly the case when a ‘"civilized” and a “primitive” 
group first came into contact. It has also been suggested (cf. 47) 
that a certain amount of selection may occur in the reverse direction, 
the superior individuals of the “lower” race being chosen more often 
for such unions. It is doubtful, however, whether such a selection is 
made on an intellectual basis to a significant degree. 

A second important consideration is that the hybrid individual 
is usually more highly assimilated to the culture of the socially domi- 
nant group than is the full-blood. Because ol the prevailing beliefs 
regarding the relative status of the two races, he is usualfy considered 
to be more capable than his full-blooded cousins. As a result, he is 
given better educational opportunities, admitted to more responsible 
positions, and afforded suiicrior facilities for advancement in every 
way. A third and related point is the lad that mixed- bloods have 
been exposed more directly to the manners and customs of the domi- 
nant group than have the full-bloods. Whether the miscegenation 
occurs through legal marriages or illicit unions, the presence of the 
white parent will on the whole tend to bring about a closer contact 
with the white culture than is the case in families where no such mix- 
ture has occurred. As a result, the economic and the social level of 
the home, as well as the degree to which English is spoken at home, 
frequently dilferentiate hybrid from full-blood groups. This is par- 
ticularly true of American Indian groups, which vary widely in their 
degree of assimilation to the white culture. 

AH these factors must be kept in mind in interpreting the findings 
of studies on hybrid groups. A few scattered investigations have been 
conducted on groups of mixed racial origin in Haw'aii ^46), Jamaica 
(8), Africa (14), and elsewhere. The most extensive data, however, 
have been collected on the American Indian and the American Negro, 
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owing to the relative accessibility of these groups in large num- 
bers. 

Hunter and Sommermier (26) administered the Otis Group In- 
telligence Test to 711 American Indians trom a large number of 
different tribes, who were attending the Haskell Indian Institute at 
Lawrence, Kansas. The subjects ranged in age from 14 years upwards. 
Extent of race mixture was determined directly by an examination 
of ancestry records. No evidence of mixture with any race other than 
the white was found. The full-blood Indians formed the largest group, 
numbering 265; only 7 members of the entire tested sampling had 
less than 14 Indian blood. As would be expected from the verbal 
nature of the test, the average Otis score of the Haskell group as a 
W'hole was much louer than the white norms, age by age. Analysis 
of performance on the separate parts of the test showed the Indians 
to be most inferior lU the more highly verbal te>ts, such a: analogies, 
opposites, matching proverbs, and narrative completion. With refer- 
ence to race mixture, a correlation of .41 was found between total 
Otis score and degree of while blood, w'ilhiii the entire Indian group. 

In a later study, Garth and his co-w/orkers (20) analyzed the 
National Intelligence 1’est scores of 609 mixed-blood and (S9 full- 
blood Indians attending Indian reservation schools in South Dakota, 
Oklahoma, New Mexico, and Colorado. A group of 67 white chil- 
dren w^as also tested for comparative purposes. The intelligence test 
scores again showed a steady rise with decrease of Indian blood, the 
averages for -bloods, *4-bloods, and ^.4 -bloods being 74, 15. and 

TABLE 48 Con ehiiion hr/nuen of While Blood and National 

/ntellii'cnce I esi Scenes within Lack School Grade 

n t . Mil ( . ■! til r' ■/ , -1 ' 


C Hide 

Nam her oj C cimh 

Co^i (‘iaiioii 

1 OLirlh 

134 

.70 

Fifth 

160 

,76 

.Sixth 

180 


Seventh 

1 12 

.23 

Eighth 

.5 

.24 


77.5, respectively. Tlie correlation between degree of white blood and 
test score proved to be -| .42. Tiie data were further analyzed in 
respect to separate school grades. In 1 able 48 will be found the num- 
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ber of cases in each grade as well as the correlation between Na- 
tional Intelligence Test score and degree of white blood within that 
grade. It should be noted that the distribution of while blood was 
similar in all grades and could not therefore account for the differ- 
ences obtained. 

These data suggest rather strongly an environmental interpretation 
of the correlation between degree of white blood and intelligence test 
perfonnancc. In the lower grades, those children with a larger per- 
centage of white blood clearly excelled their fellows. In the three 
upper grades, however, the relationship is very low and barely signifi- 
cant. Thus continued education in a common school seems to reduce 
and even wipe out the apparent relationships with degree of Indian 
blood. 

Klincbcrg (30) reports an absence of linear relation between de- 
gree of Indian blood and test performance in a group of 100 Yakima 
Indians in the state of Washington. The tests were taken from the 
Pintner-Paterson Performance Seale and were largely dependent upon 
speed. The Indians as a whole obtained lower scores than <i group of 
100 white boys who had been similarh tested. (Comparison of full- 
blood and mixed-blood groups, however, gave conflietinf results, the 
poorest scores having been obtained by those subjects with the most 
and those with the least Indian blood. 

More recently, Telford ( 52 ) investigated specifically the effect of the 
cultural eonlenl ol the test upon the performance of h}brids. The sub- 
jects wa^re students at various Indian schools in North Dakota and Mon- 
tana. The tests included two scholastic achievement tests, two gioup 
intelligence tests (Otis and Kuhlmann- Xnderson), the Peterson Ra- 
tional Learning Test (a non-veibal test), six performance tests from 
the Pintner-Paterson series, and the Goodenough Draw-a-Man T est. 
The results support the hypothesis that the superiority of mixed- over 
full-blood Indians, reported in some of the earlier investigations, is 
due to the greater familiarity of the mixed-bloods with English and 
with information based upon the white culture. In the achievement 
tests, Telford found the mixed-bloods superior to the full-bloods. A 
smaller but still significant difference in favor of the mixed-bloods was 
obtained on the intelligence tests. On the Peterson Rational Learning 
Test the two groups were equal, and on the performam'e and Good- 
enough tests the full-bloods showed a small, insignificant superiority. 

Further evidence for the cultural hypothesis is provided by a study 
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conducted among the Osage Indians by Rohrcr (49). Osage children 
were chosen for this study because they are the most nearly com- 
parable to white children in socio-economic level, use of the English 
language, and schooling. Rapport and motivation in the testing situa- 
tion are described as having been equally good among the Indian and 
white children tested. All the Indian children were attending cither 
public schools or tuition schools, and were compared with white chil- 
dren attending the same schools. The proportion of Indian blood for 
each child, as determined by ancestiy records, ranged from to 
100%. On both the Goodenough Draw-a-Man test and the Otis test, 
no difference was found between groups differing in per cent of 
Indian blood. Moreover, the Indian children as a group did not 
differ significantly from the while controls, nor lioin the w^hite norms, 
in either test. The number of cases tested, mean scores, and correla- 
tions with degree ol Indian blood arc given in Table 49. 


TABLE 49 Test Scoics of Osai*c Indian Child/ en in Relation to 
Degree of Indian Blood 




i 1 It 1 tn 




Test 

Number 

Indian 

of C USi S 

W lute 

Mian JQ 

Indian White 

C ottelaUoti 
w ith Pcfiree 
of Indian Blood 

Goodenough 

Draw-a-Man 

125 

125 

103.80 

102 92 

.01 

OlJs S-A: 
Intermediate 

110 

110 

100.05 

98 05 

002 


In investigations on the Atneri an Negro, ancestry records have 
not generally been available, so that degree of race mixture has had to 
be determined more indirectly on the basis of physical chaiacteristics. 
In an early study on 907 Negro school children in three Virginia 
cities, Ferguson (13) reported a steady rise in psychological test 
performance with increasing proportion of white blood. Four simple 
psychological tests were administered: analogies, sentence completion, 
A-cancellation, and stylus maze. No anthropometric measures of 
racial characteristics were taken, the subjects being classified by 
inspection into four groups on the basis of skin color, hair color, 
and shape of head and face. 
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In a later study, Peterson and Lanier (44) administered a num- 
ber of “ingenuity” tests as well as intelligence scales to 12-year-old 
Negro school children in Nashville, Chicago, and New York City. 
Several of the tests were non-verbal, an important consideration in 
the comparison of racial groups with diverse educational opportuni- 
ties. Ratings of skin color on a 7-point scale were obtained on the 
Nashville and Chicago groups. In Table 50 are shown the correlations 
between lightness of skin and scores on the five tests employed. 

TABLE 50 Correlations hetwen Lightness of Skin and Test Scores 
of 117 Negro School Children 


l*ftcMM)n ai'<l I.amci, H. p. 


Test 

N uni her of Cases' 

Con elation 

Billet (jroup Test 

83 

.18 

Myers Meie.it) Measure 

75 

.30 

Ralionit) Lciirning, Time Score 

117 

.05 

MLiiliil Vlj/e, Time .Score 

113 

.14 

Dj.sc fran^sfci, lumc Scoie 


.39 


In view of the inadecjuacy of skin color as a criterion of race, 
more extensive measures were obtained on the group of 75 New 
York City subjects. Correlations were computed between score on the 
Yerkes Revision of the Binet Intelligence Scale and each of the tour 
physical trails which were found to diflerentiate most clearly between 
white and Negro subjects. 7’hcse correlations are shown in Table 51 
below. As will be seen, the correlations arc all too low to indicate 
a significant degree of lelationship. 


TABLE 51 Correlations between Intelligence Test Scores and 

Anthroponictnc \1e(isiires on 75 Negro School Boys in New York City 


M'lom iM'.rson aufl 14, p. yO) 


Measure 

Conflation 

Nose 

-.11 

Lip thickness 

.07 

Tar height 

-.15 

InhTpupillary span 

.01 

Composite of these four traits 

-.13 
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Klineberg (30), in the previously described investigation with the 
Pintncr-Patcrson Scale, also tested 139 Negro boys between the ages 
of 7 and 16 in rural sections of West Virginia. The correlations^ 
between intelligence test score and three anthropometric measures 
indicative of degree ol Negro blood are given below: 


Nose width 

.083 

1-ip thickness 

.068 

Black pipmentalion 

.12 


As in the study of Peterson and Lanier, the relationship between 
test performance and index of Negio blood is negligible when objee- 
live anthropometric measures of race mixture are employed. In a 
group of 115 Negro men students .it Howard University, Herskovits 
(24) likewise found no signiticant correlations between inteiligence 
test scores and the same three anthn^pomclrie measures ol Negroid 
eharacicrislies. 

Also relevant to the queslion of race mixture are the data col- 
lected on gifted Negro children. Witty and his studenls (27, 2S, 29, 
53, 54, 55 ) have repoited a number of test surveys, ease stiuiies, and 
follow-ups of Negro children whose IO\ ranged from 120 to 200. 
in one such sur\cy of published studies dealing with intellectually 
superior Negro childien, Jenkins (29) assembled case records of 18 
Negro children wlio tested above 10 160 on the Stanford-Binet. It 
miglit be noted parenthetically that the results of all these studies arc 
closely similar to those ohtainetl by Tcrman and others on gifted 
wliitc children (ef. Ch. 17). Inteiiectualiy superior Negro children, 
like white children of currespouding SO, tend to excel in height, 
weight, and general physical development, they are on the whole 
superior in character and person, ity; and their parents have more 
than average education and tend to cluster in the higher occupational 
levels. Lor the present purpose, how^ever, it is the racial background 
of such gifted Negro children that is of special concern. 

In general, the distribution of white and Negro blood in such 
intellectually superior groups is no dificrern from that in the general 
American Negro population. In 'u * survey by Witty and Jenkins 
(54), 63 Negro school children with JO’s of 125 or higher were 
classified into four categories ol race mixture on the basis of genea- 
iogical data secured from the parents. In 1'abic 52 wall be found the 
percentage of children failing into each of these categories for the 

’ The influence of ape was ruled out by the partial correlation technique. 
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entire group of 63, as well as for a sub-group of 28 with IQ’s of 140 
or higher. For comparative purposes, the corresponding percentages 
for the general Negro population are also given. It will be seen that 
there is no consistent tendency for the proportion of white blood to 
be greater in the gifted groups than in the general Negro population. 
It is also interesting to note that the highest IQ in the group, 200, 
was obtained by a Negro girl whose ancestry showed no evidence of 
white mixture (53, 54). 

TABLE 52 Degree of White Blood among Negro School Children 
with IQ's of 125 or Higher 


( \(lrii)tcd from Witty and JiMikiiis, 54, pp. ISO lOp) 


Degree of 
While Mix line 

Per Cent in 
General Negro 
Population 

Per Cent among Gifted Negro ( hildien | 

IQ 125 07 Higher 
(N - : 63 ) 

IQ 140 Of Higher 
(N - 28) 

No white ancestry 

28.3 

22.2 

21.4 

More Negro than white 31.7 

46.1 

42.8 

About equal 

25.2 

15.9 

21.4 

More while than Negro 14.8 

15.9 

14.3 


* less than of mother’s and of father’s ancestry reported to be white. 


Considerable caution should be exercised in generalizing from 
these findings, since the number of gifted Negro children included 
in such studies is quite small. Several attempts have been made to 
compute the percentage of Negro children falling within various seg- 
ments of the distribution of intelligence, and to compare the resulting 
figures with similar figures for white children. Such a comparison is 
complicated, on the one hand, by the fact that the samplings em- 
ployed may not be equally representative of the entire white and 
Negro populations, respectively, and on the other hand, by the fact 
that socio-economic and other environmental conditions are not 
equated in the two populations. It is not surprising, therefore, that 
estimates of the relative incidence of gifted children among Negroes 
and whites have varied so widely (cf., e.g., 19, 29). One fact which 
is clearly brought out, however, is that high intelligence is not pre- 
cluded by any degree of Negro blood. Individual cases cf highly gifted 
children can be found among Negroes of any degree of racial mixture 
or purity. 
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All in all, the available data on hybrid groups tend to uphold a 
cultural rather than a biological hypothesis of observed race dif- 
ferences in tested abilities. Among the previously cited findings in 
support of such a conclusion may be mentioned the following: 

(1) The correlation between degree of white mixture and intelli- 
gence test score among Indian children tends to decrease as 
amount of education increases. 

(2) The correlation between degree of white blood and test score 
among Indian children is highest for verbal tests and for tests 
depending upon information characteristic of the white cul- 
ture. It is lower on non-language tests and drops to virtually 
zero when both speed and language arc eliminated, as in the 
Goodenough test. 

(3) Among Negroes, the correlations between test scores and 
degree of while mixture have in general been much lower 
than among Indians. Corresponding to this finding is the fact 
that difi'erences in use of English and in assimilation of the 
white culture are much greater among Indians than among 
Negroes. 'These differences arc quite closely related to degree 
of white mixture among various Indian groups. Moreover, 
mixed-blood Negroes are more likely to be classed socially 
with the Negro race, regardless of the amount of white blood. 
In tlic case of Indians, the mixed-bloods are more likely to be 
classified in accordance with their proportion of white blood, 
rather than being hdiseriminately regarded as “Indian.'’ 'This 
would make foi more socially determined differentiation 
among Indians with different degrees of white blood than 
among Negroes with different degrees of white blood. 

(4) In Negro studies in which classifications of race mixture were 
based upon objective physical measures, the correlations be- 
tween test scores and degree of while blcx)d were much lower 
than when insnectional criteria were employed. Skin color 
likewise tended to give higher correlations than other criteria 
wliich vve-’-e equally good oi i>etter indices of white mixture. 
These findings suggest that it is not so much the actual amount 
of mixture as the general observable resemblance to the 
white race which w-as correlated with test pcrfoimance. Such 
a general rcseniolanec would play an important part in the 
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social contacts and everyday life opportunities of the mixed- 
bloods. In other words, it may have been the degree of social 
acceptance, rather than the amount of race mixture, that 
determined these correlations. 

J'J GIONAL riJT LRIiNCTS AND MIGRATION 

Another procedure which may contribute to an analysis of the role 
of cultura! and biological factors in group dilTercnccs is tiie compari- 
son of samplings of the same race living in dilTercnt regions. This 
is especially Iruilful when the regions present sharply dilfcrentiatcd 
environmental milieus. When feasible, iho direct study of migrating 
groups before and aber migration, or following different periods of 
residence in the new area, should permit a more clear-cut evaluation 
of contributing factors tlian is possible in static cornpaiisons of 
regional groups. 

It is a \veil-esi:tb!i.'>hed fact dial watliin any one racial group there 
are w'idc differences in tested abilities iroin one part of the I hilled 
wStales to another. This was first vividly demonstrated by a stale-by- 
statc eoinpilaiioii of the army test results in World Wlw 1 (I, 5b). 
Jn some stales, the median Alpha score of white enlisted men was 
as )oW' as 41, in others as high as 79 or 80. To be sure, such regional 
differeiiees may in part nhleet dilferenccs in testing pvihcy in the 
various areas, especially as regards the administration of Alplia or 
Beta (ef. Cli. Bui it is uniihely that this factor could account 
wholly or even in large part lor the dilfcrences obtained. Similarly, 
the proportion of foreign-born in each state could not have been a 
major factor, since most of the foreign-born were probably lc\ ted 
with Beta." A number of interesting correspondences were found 
between the rank -order of the states in Alpha medians and their rank- 
order in ceiiain environmental measures, such as socio-economic 
and educational indices. I'or example, the 4i states in wliich Alpha 
scores were a\'ailiiblc for at least 500 men were ranked for '‘educa- 
tional cnicieucy.’" The latter was based on such records as percentage 
of daily school attendance, percentage of children attending high 
school, per capita expenditure for education, and teacher salaries. Ihc 

-■ In f^Kl, a CH)ri elation of .61 was found between per cent of foi ''ig;n-born popnJa- 
tio’^ ant* Alpha meil’un ft*i eath sL.te. 'this cot tespoiuk nee piobably lesultcd spiiri- 
oaslv lioni the fact that the states with jelalivcly lar^c lorei^rn populations weic also 
the moie highly inflasiriahzcd and wealthier states, with beUer educational lucilities. 
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two sets of ranks, for Alpha score and for “educational clTicicncy,” 
correlated .72. 

Similar geographical dillerenccs have been found in the AGCT 
scores obtained in World War H (9, 50). Marked variations in test 
performance were noted among tiie nine Service Commands, or major 
areas into winch the country was divided in Seicetive Sen ice classifica- 
tions. in general, the southeastern and soiithv\estern slates had the 
highest rates ol rejection for intcl!ectual inadequacy, as wcil as the 
largest percentage of men m arm> grades IV and \' c^a liie AGCV 
(9J. Even when men in the sa/uc oci ui^aiioiis were compared, the 
samples irom nortiiern states generally obtained signiticaully higher 
AGCr scoies than tlios.e from souliiern slates (50) 

These regional dillercnces aie just as characteristic of Negroes as 
they are oi whites. In both World Wars, the army testing showed 
diilerences wliich were fu’lv as large among the f^iegio saiuplmgs 
from dilTercnt states or regions as among the white groups (9, 56). 
Moreover, the rank-ordei of the dilleienl areas wsis closely similar 
for both racial groups, Sucii results suggest duit Negro lest scores arc 
as responsive to tlic cn\ ironmenlal diiVerenees represented by the 
various regions as arc white test scores. Ol speciad interest in stmlics 
on the American Negro are comparisons between northern tuui .‘Oiilh- 
ern Negroes. Diifcrcnces betw'een such groups imdoubicdly result in 
part from the same socio-economic and educaiioriai dillerenccs which 


TABLE 53 Alpha and lU ta Medians oj Norlhcni ard Semthern 
Diajt and of Najivc-lhfn. lih lc Drad 

('/'il.ii'tul ironi 5«>, 1 * 


SanipliJii; 

, ■ ’infill 

Urta 1 

N 

TvleJjaii 

N 

Median 

White r.:itivc-horil 

5i,6?a) 

5S.9 

1 1,S79 

41.4 

Noilhcrn Negro 

2.K5() 

6 

1737 

32 5 

Soul hem Negro 


12.4 

3,438 

19.8 


account for the diilercnccs between whiles from northern and southern 
states. But they probably also reflect, to a certain extent, differences 
in the relative social position of the Negro in the North and the South. 

In Table 53 will be foimJ the Alpha and Beta medians of northern 
and southern Negro draftees in World War 1, together with the num- 
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ber of cases in the samplings from which these medians were de- 
rived. "the corresponding medians for the white native-born draft 
from the entire country are included for comparison. The Alpha and 
Beta results cannot be directly compared, because the scores are in 
different units and because the selective factors affecting Alpha and 
Beta samplings varied in different regions (cf. Ch. 20). The data do, 
however, serve to illustrate the large difference in average test per- 
formance between northern and southern Negroes. 

This regional difference htis been repeetcdly corroborated in studies 
on Negro school cliildren and college students. In the previously cited 
investigation by Peterson and Lanier (44), for example, large and 
highly signilicant dilfercnccs in test scores were found bciwcen the 
New York City and the Nashville gioups of 12-ycar-old Negro school 
children. In a more recent study by Roberts (48), the ACE scores 
of 253 Negro male college freshmen were analyzed. Comparisons 
were made m terms of parental occupation, veteran or non-veteran 
status, and northern oi southern ongm Lar<’er and more significant 
differcTices ni AC4 se( u‘ were toiind between comparable groups 
from the Noidi and tlu Somh than m an\ of the comparisons within 
a given geoiZKcphkai lejioi 1 he regainal JillereikC persisted when 
ccanpaiisons were made between gioups matched in occupational level 
of parents. 

The latter study illustrates a finding which has been repeatedly 
demoiistiatcd, viz., ioimal eJitcation and solio-cc oiunnic In el aie not 
cnoui^h to account for differences in the tested abilities of northern 
and soutlicrn Ncgiocs. When the amount of education is held con- 
stant, the regional diflercnces are reduced but by no means eliminated 
fcf , c g , Table 46, C h, 21), The same point can be made regarding 
Negio-Vvhile tli(lc»enccs in intelligence test scores. When only amount 
of education is held constant, part of the icmaining group diflerence 
mav be due to dissimilarities in the quality of education received by 
whiles and Negroes, or by northern and southern Negroes. 

There is some evidence, however, to indicate that even when Negro 
and white children in the same sch(»ols arc compared, intelligence 
test performance favois the whites. This was demonstrated in an 
investigation by Tanscr (51) on Canadian Negroes. Similarly, in a 
study conducted by Hrucc (7) in a pour rural distrLt in Virginia, 
the white children averaged higher than the Negroes on intelligence 
tests, even though an effort was made to control educational and 
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socio-economic variables. Superficially such findings appear to show 
a true “racial,” or biological, difference which persists even when in- 
equalities of education and socio-economic level are eliminated — and 
they have been ghen this interpretation by some writers. On the 
other hand, it should be noted that no study has satisfactorily con- 
trolled both socio-economic level and educational facilities in Negro- 
white comparisons. Thus in Tansers study, although the Negro and 
white children attended the same schools, there were signiiicant dif- 
ferences in s(Kio-cconomic level between the two groups. In the 
investigation by Bruce, educati ^n was equated only by choosing Negro 
and white schools which had the same teacher-pupil ratio, other 
probable diffciences between these schools remaining uncontrolled. 
In the same study, sub-groups of 49 Negro and 49 white children 
were matched in Sims socio-economic ratings; but the uithor ad- 
mitted that this sca^e was unsuiled to the groups studied because it 
docs not discriminate adequately at the lower socio-economic levels, 
where most of the subjects fell. 

The psychological environment, moreover, includes much more 
than formal schooling and socio-economic class. The many subtle 
emotional and motivational influences associated with minority group 
status and with Iraditiontil stereotypes still remain as uncontrolled 
factors in all these group comparisons. It is interesting to noie, for 
example, that in lanser’s study the white children attended school 
much more regularly than the Negro (51). Within the entire sam- 
pling of white children tested, school attendance averaged 93.38%; 
within the Negro group, averaged 84 . 77 %. Such factors as family 
traditions, social expectancy, and outlook for adult opportunities 
may all be reflected in these school attendance figures. Group dif- 
ferences which are the result of a Luge number of variables obviously 
cannot be wiped out b\ holding one or two variables constant. 

One of the frequcnti> cited results of the army testing in World 
War 1 was that the median Alpha scores of Negroes from Illinois, 
New York, Ohio, and Pennsylvania were somewhat higher than the 
median Alpha scores of whites from Arkansas. Georgia, Kentucky, 
and Mississippi. More than twenty i 'c years after these particular 
data were collected, they were resuscitated, re-analyzed, and made the 
subject of a scientific storm in a teakettle (cf., e.g., 2, 3, 5, 15, 16, 
17, 18, 19). Obviously these data were not meant to provide an 
adequate comparison of Negro and white pertormance, since the 
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samplings were hardly comparable. The groups were not even repre- 
sentative of all Negro or white recruits from the respective slates, 
since the proportion of men tested with Alpha and Bela differed from 
state to state and between Negroes and whites. Tho relative standing 
of these groups, of course, follows from the fact that there were large 
regional differences among both Negroes and whites. The data do 
provide a vivid illustration of the extent of overlapping between the 
white and Negro distributions. Not only could many individuals be 
found in the lower (Negro) distributiem who excelled individuals in 
the higher (white) distribution, but also local groups could be found 
in the lower-scoring population which excelled other local groups in 
the higher-scoring population. 

In explanation of regional differences in intelligence test perform- 
ance, two contrasting hypotheses have been proposed: one in terms 
of cnvironnicntal haudii ap, the other in terms of selective migration. 
The former attributes the regional differences to inequalities in home 
conditions, educational facilities, and other opportunities for advance- 
ment. 'fhe latter proposes lliat the more intelligent and progressive 
individuals, who have more initiative and arc better able to adjust to 
new surroundings, are more likely to migrate to the nv)re desirable 
areas. 11ie one hypothesis maintains that superior ability is a result 
of migration to a more favored area, the other that the migrating 
individuals were superior t(' begin with. Although the selective migra- 
tion hypothesis is commonly coupled with a hereditary interpretation 
of regional differences in ability, it should be noted that this does 
not necessarily follow, 'Thus it is likely that persons of superior edu- 
cational and socio-economic level are more often aware of the 
opportunities offered by migration to a better area. This would be 
true regardless of whether hereditary or environmental factors were 
initially responsible for the higher educational and socio-economic 
status of sueli individuals. These persons may in turn have more 
intelligent offspring, not necessarily because of better “genetic stock,” 
but because they provide their children with a more stimulating home 
environment. Logically, therefore, the selective migration hypothesis 
is equally consistent with a predominantly hereditary or a prei^omi- 
nainly environmental determination of individual differences. If, on 
the other hand, regional differences can be shown to have developed 
after migration, then they can be explained only in environmental 
terms. 
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A number of investigations have been specifically designed to test 
the selective migration hypothesis with reference to northern and 
southern Negroes (32, 3(S, 41), In one survey on several thousand 
Negro school children in Washington, D. C., Long (38) found signifi- 
cant differences in mean IQ in favor of Washington-born children. In 
separate comparisons made among first, third, and fifth grade chil- 
dren, the critical ratios of these mean IQ din'crences (diif oa . ) 
ranged from 3.48 to 6.71. In other words, the children who had 
migrated to Washington — in most cases from inferior southern com- 
munities — were not as intelligent as those born and reared in Wash- 
ington. In a more intensive study of a random sample of the migrant 
children, a significant positive correlation was found between length 
of residence in Washington and lO.-' 

The most ambitious effort to check the applicability of the hypoth- 
eses of selective migration and of environmental handicap to be 
found in the series of investigations by Klineberg and his students 
(32). The problem was appioached in two ways. First, the relative 
intellectual status of Negro children whose families had migrated to 
the North was investigated by comparing their former grades in 
southern Negro schools witli the norms for those schools. In this 
part of the study, the records of 562 Negro cliildrcn who had moved 
to the North from tlirce southern cities ^ were examined. Since all 
grades were transmuted into a percentile scale, a score of 50 repre- 
sents the average status, and this figure may be employed as a stand- 
ard of comparison. T he average percentile rating of those children 
wdio had moved to the N^ath proved to be 49.3, which is not signifi- 
cantly different from the general average. It is tlius apparent that, at 
least in these groups, there was m tendency for the initially superior 
children to migrate. 

A second approach to the problem involved the comparison of 
intelligence test scores obtained by groups of Negro school children 
who had lived in New' York Cit\' for different periods of time, dhe 
subjects were examined with a variety cl standard tests, including 
the Stanford-Binct, pciformance . '. 'es, and several common group 
tests. Over 3000 10- to J2-ycar-oid Negro children in the Harlem 
district of New York City were tested. The subjects in the different 

''^Thc correlation ratio (rfa) was comp Med, s.ncc the relation bclvsecn length of 
icsidc’ce and IQ was foanvl Sj be ciitvilmear. 

Nabhvdille, Tenn., tirninigham, Ala., and Charleston, S. C. 
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residence groups were equated for age and sex; they attended the 
same schools and were approximately equal in socio-economic back- 
ground, the only important difference between them being the num- 
ber of years spent in New York City. A group of Negro school chil- 
dren born in New York City was also included for comparison. 
Special checks were employed to demonstrate that the differences 
between the various residence groups could not be attributed to dif- 
ference in the proportion of white mixture, nor to a progressive 
decline in the quality of migrants coining to New York in suc- 
cessive years. 


TABLE 54 Relation between Length of Residence in New York City 
and Intelligence Test Scores of Negro School Children 

( Adapt t\i from KUneberg, 3J) 


National IntclU^vncc 

7 est 

Sianjord-Binct 
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Years of 
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IQ 

1-2 

150 

72 

Less than 1 

42 

81.4 

3-4 

125 

76 

1-2 

40 » 

84.2 

5-6 

136 

81 

2-3 

40 

84.5 

7-8 

112 

90 

3-4 

46 

85.5* 

Over 8 
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94 

C)ver 4 

47 

87.4 

Northern-born 

1017 

92 

New York-born 

99 

87.3 

Minnesota Paper harm 

Hoard j Pintner- 
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Score 

Residence 

of Cases 

1-2 

27 

39 00 

Less than 2 

20 

142.5 

3-4 

25 

26.67 

2 5 

20 

139.8 

5-6 

30 

31.88 

Over 5 

20 

152.1 

7-8 

23 

37,50 




9-10 

25 

37.50 

Northern-born 

50 

164.5 

Over 10 

41 

37.50 




New York-born 

223 

41.61 





* ThiS figure is misprinted as S8.5 in the Klinebcrg monograph (32, p. 46). 


In Table 54 will be found the mean scores of each residence group 
on the National Intelligence Test, the Stanford-Binet, the Minnesota 
Paper Form Board, and an abbreviated form of the Pintner-Paterson 
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Performance Scale In both the National Intelligence Test and the 
Stanford-Binet, there is a progressive rise in average score with increas- 
ing length of residence in New York City 1 he means and SD’s obtained 
with the National Intelligence Test, which was gi\cn to the largest 
number ol cases, are shown graphically in Figure 94 It is interesling 



Y 1 r^:>idii(i in Njrthcrn Ci*y ^ 

1 ijl. 94. Mean incl SD Oi Nilionil Intcllmcncc 1 Scojls of Negro 
Children ip RlIiUoii to 1 c (h of f siJciiue in a Noilhun ( rty (Dila 
from Klmtluig 32 pp 2(> o ) 

to note that the eroiips bom in the Noiih or in New York Chty are not 
superior to those who WLie born in the South but had lived in New 
Yoik foi a loiv’ period I ‘us la J done v iih the consistent inciease m 
score with Icneth 4 Ntw Yoik C it\ lesict lec, tends to suppor* the 
environmental rather th in the selee i migr ition hypothesis Although 
overlap is huge and dideiciKcs between adiacent groups aic too small 
to be statistically signilie the moie extreme dillerences in National 
Inlelhgenec lest or Staiilord-BiiK t means arc significant at a high 
level of conlidcncL \Vc should hardly expect dilTercnccs ol only one 
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or two years in length of residence to affect test performance. Longer 
periods in a more favorable environment are, however, clearly re- 
flected in these rising means. 

The contrast between the trend observed in the verbal tests, on the 
one hand, and in the non-verbal tests, on the other, is also of interest 
in this connection. The rise in lest score with increasing length of 
New York Lily residence is more consistent in the case of the National 
Intelligence Test and the Stanford-Binct than in the case of the 
Pintner-Paterson and the Minnesota Paper Form Board. This is to 
be expected, since the two verbal tests are much more highly de- 
pendent upon the type of information which w^ould favor children 
reared in the urban New York environment. With regard to the two 
non-verbal tests, the Pintner-Paterson and the Minnesota Paner Form 
Board, the latter is even less dependent upon information (T a specific 
cultural nature. And tlie results did show a more marked trend in 
the Pintner-Paterson Ilian in the Minnesota Paper Form Board. All 
dillerences on the Minnesota test w'ere insignificant, while a few of 
those found on the Pintner-Paterson approached the commonly ac- 
cepted levels of significance. Thus, on the whole, the available evi- 
dence concerning regional dilTerences among Negro sclmol children 
favors the environmental hypothesis nioie strongly llnin it does the 
selective migration hypothesis. 

CROSS-COMPARISONS \MONC^. RACIAL 

AND NAIIONM C.ROIjPS 

Lross-comparisons among individuals classified into racial and into 
national categories have been made with immigrant groups in this 
country, as well as with the paient populations in luirope. 'The first 
extensive effort to compare the intelligence test scores c^f pAiropean 
immigrant groups in America was Brigham's analysis of the Army 
data obtafined during World War 1 (6). Brigham computed the mean 
combined scale scores for 12,492 foreign-born draftees, classified 
according to country of birth. The resulting hierarchy of national 
groups, however, was of little significance in itself because of the 
uncontrolled operation of many of the factors di^?cussed in Chapters 

■' The “combined ^cak" was a means of tiansmulinLi; scoics on Alpha, Itcla, aiiii 
individual tests into compaiablc units, in older to permit the direct comparison of 
individuals who iiad taken diffeicnt tests. 
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20 and 21. For example, the two highest means were obtained by 
groups from English-speaking countries. Similarly, a consistent tend- 
ency was iound for the mean combined scale score to rise with in- 
creasing length of residence in America, regardless of nationality. 

Brigham further undertook to compare Nordic, Alpine, and Medi- 
terranean sub-groups within the same sampling of foreign-born sol- 
diers. For this purpose, he employed rough, available estimate' of 
the proportion of Nordic, Alpine, and Mediterranean elements in each 
country. France, for example, was estimated as 30% Nordic, 55% 
Alpine, and 15% Mediterranean; Sweden, as 100% Nordic; Rou- 
mania, 100% Alpine; Germany, 40% Nordic and 60^^ Alpine. The 
distributions ol intelligence test scores lor each national group were 
then cut according to these proportions and recombined into Nordic, 
Alpine, and Mediterranean distributions. Fo^ example, all the Swedish 
scores were classified under Nordic, all the Roumanian imdci Alpine. 
In those cases in which more than one racial group was represented 
within a single nation, the average score of that national group was 
allotted {')r()portionatcly to each racial group, d'hus 40% of the Ger- 
man sampling was entered under Nordic and 60% under Alpine. 
Since there was no way of determining in which portion of the na- 
tional distribution of scores the Alpine and Nordic individuals fell, 
ail subjects were given the average score of their icspeetivc national 
group. By this method, Brigham found the Nordics to have a signifi- 
cantly higher mean score than the Alpines, and the Alpines to have a 
significantly higher mean than the Mediterraneans, 

It is apparent that tin,, procedure involves a logical fallacy in so 
far as it assumes tlie absence of dilferences in score between racial 
groups wathin a single nationality, and at the same time it undertakes 
to prove the existence of just suci» a difference among racial groups. 
wSince no differentiation was made among individual members of dif- 
ferent races within any single national group, the subjects being 
chosen indiscriminately from the entire distribution of national 
scores, nothing was really gained by the rcclassitication into Nordic, 
Alpine, and Mediterranean. More '\er, the same uncontrolled factors 
which rendered the comparist)n of national averages invalid in this 
study also operated in the comparison of the three racial groups. For 
example, the comparison of English-speaking and non-English-spcak- 
ing Nordic groups revealed a significant difference in favor of the 
former. 
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An attempt to classify individuals more empirically into racial 
groups was made by Hirsch (25) in an investigation of children of 
immigrants in the United States. The main group of subjects con- 
sisted of 4983 Massachusetts public school children ranging in age 
from 5V2 to 18 and in school grade from the first to the ninth. In 
social and occupational level the group was quite homogeneous, all 
the subjects living in small manufacturing communities. There was no 
segregation of national groups into districts and all the children 
attended the same schools. Group intelligence tests were chosen which 
relied somewhat less on language and on speed than is usually the 
case, although these factors were by no means eliminated.^’ The chil- 
dren were first classified into national groups on the basis of parents’ 
birthplace. An ‘‘American” group of native parentage was also in- 
cluded for comparative purposes. 


TABLE 55 

Parentdi'c 


Mean IQ's of American School Children of Foreign 

• 1' I (.nt llu sO), ]’ J'<7 ) 


Nauoiicilitv 

Swuher 

Oj i tOi’.V 

Miw? JQ 

SI) 

Polish Jews 

75 

102.8 

14.55 

Swedish 

232 

102.1 

15.48 

f’nglish 

213 

100.7 

14.85 

Russian Jews 

()27 

99.5 

14.58 

Germans 

190 

98 5 

15 09 

Americans 

1031) 

98.3 

15 87 

IJlhiianians 

468 

97 4 

13 89 

Irish 

214 

95.9 

16.08 

Bnlish Canadians 

135 

9T8 

14 67 

Russians 

90 

^^0 9 

12 93 

Poles 

227 

89.6 

12 96 

Gi'ccks 

270 

87.8 

15 12 

Italians 

350 

85.8 

11 94 

French Canadians 

243 

8^3 

14.35 

Porliiguese 

671 

82.7 

13.47 


The results of this analysis arc shown in Table 55. Most of the 
differences between the average IQ’s of these national groups were 

® The folJowing tests wcic administered in different grade levels: 
Pjntner-Ciinninghani Primary Scale — first grade 
Dearborn Test A — second and third grades 
Dearborn Test C — fourth grade upwards 
To reduce the role of speed, all tests were given with a longer time limit than is 
specified in the standardized directions. 
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statistically significant. The same rank-order of nationalities was ob- 
tained when the groups were compared in the percentage of “very 
superior intelligence’’ and of “borderline deficiency.*” The relative 
status of the national groups also agreed in general with that reported 
in previous investigations on children of foreign parentage. Taking the 
national groups as a whole, Hirsch found no evidence for the con- 
sistent superiority of any one racial group. Thus among the eight 
highest entries in Tabic 55 are to be found two predominantly Nordic 
groups (English and Swedish), two which are largely Alpine (Ger- 
mans and Lithuanians), one predominantly Mediterranean (Irish), 
and three composite or mixed groups (Americans, and Polish and 
Russian Jews). 

In order to arrive at a somewhat more accurate determination of 
“race,” Hirsch classified each individual into a racial type on the basis 
of eye and hair color. All subjects, irrespective of their national de- 
scent, were divided into three major categories: the ''blond type'" 
with light hair and blue, gray, or hazel eyes; the "brunette type*' with 
black hair and gray, hazel, brown, or black eyes; and the "mixed 
type'* exhibiting all other combinations of hair and eye color. The 
blond type was taken to correspond roughly to the Nordic and the 
brunette to the Mcditerianean race. The m'xcd type would of course 
include Alpines as well as mixtures of any of the three racial stocks. 
This method of classification is, to be sure, crude. Hair and eye color 
are not generally considered to be very valid criteria of race. The 
analysis is, however, suggestive as a first attempt in the direct classi- 
fication of individuals int^ racial categories. 

The results of this analysis likewise lent no support to a racial 
interpretation of group dilTereiices in intelligence test scores. No one 
of the three physical types was (onJsrently superior or inferior within 
all national groups. Ihus among the representatives of one nation 
the blonds stood first- among those of another nation the brunettes 
led. The differences between physical types, furthermore, were much 
smaller than those within a single type. The differences in IQ between 
any two types within a single nation rangea from 0.1 (between blond 
and mixcd-t\pc French Canadian. ; to 6.7 (between brunette and 
mixed-type Poles). The differences between the lowest and highest 
national averages within any one physical type, on the other hand, 
were all considerably larger. Thus the mean difference in 10 between 
the highest and lowc..t blond groups was 14.8; between the highest 
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and lowest brunette groups, 18.1; and between the highest and lowest 
mixed groups, 21.3. These cross-comparisons between national and 
physical or “racial” categories thus suggest that the obtained dif- 
ferences arc more closely linked with national than with racial 
background. 

The chief weaknesses in Hirsch’s study are: (1) the likelihood 
that the samplings tested were not representative of their national 
populations because of selective factors in immigration; and (2) the 
use of very crude criteria for racial classification. Both of these limi- 
tations were avoided in a study conducted by Klincberg (31) in 
Europe, in which an attempt was made to obtain as pure samples of 
Nordics, Alpines, and Mediterraneans as possible. The subjects were 
700 10- to 12-year-old school boys in rural sections of France, Ger- 
many, and Italy.' The samples were taken from those geographical 
areas in which ethnic maps showed a predominance of pure strains 
of each of these three racial groups. Only children who had them- 
selves been born in the particular area, and both of whose parents 
had likewise been born in the ^ame area, were included in the study. 
The subjects were further selected on the basis of three physical 


TABLE 56 Cojupdrison of National and Racial Groiip:!^ on a 
Performance Scale 

( I « n i)t I f, .M . ]t 2 "' } 


(lump 

Pi(> \ nice 

Nunihi'f of 

Pi‘)!o 

rnuinve Scuh 

' Si ore 




Mean 

Median 

Range 

1. Gci Titan Nordic 

Hanover 

\1 

198.2 

197.6 

69-289 

2. t rench Medi- 

Eastern 





tei rancan 

Pyrenees 

J2 

J97.4 

204.4 

71-271 

3. German Alpine 

Haden 

to 

193.6 

J99.0 

80-211 

4. Italian Alpine 

Piedmont 

IQ 

J88.8 

186.3 

69-306 

5. Flench Alpine 

Auvergne 






and Velay 

19 

180.2 

185.3 

72 296 

6. French Nordic 

7. Italian Medi- 

Flandcj s 

13 

178.8 

183.3 

63-314 

terranean 

Sicily 

9 

173.0 

172.7 

69-308 


Rural groups were chosen since too piuch intermixture had occurred in urban 
districts to yield a suHicient number of “pmt-* types” Ihiec city gioups, m Hamburg, 
Paris, and Rome, wcie also tested for ctmiparativc purposes, ['he results of this testing 
will be reported in the following chapiei. 
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criteria: eye color, hair color, and cephalic index. No subject was 
retained unless he fell within the specified limits for his racial group 
in all three criteria. The groups were comparable in socio-economic 
and occupational levels, the dilTcrcnccs among them in these respects 
being relatively slight. 

Hach subject was examined individually with an abbreviated form 
of the Pintner-Paterson Periormance vSeale, consisting of six tc^ts.^' 
Brief oral instructions were given in the subject’s native language. 
Performance was scored in terms of speed as well as accuracy. In 
d able 56 will be found the mean, median, and range of scores within 
each group. The geographical location of the group and the number 
of villages covered are also given. The number of cases is exactly 
100 in each ol the seven groups. 

The scores show marked variations among dilTcrent samples of the 
same racial group. 1 lie alleged Nordic-Alpme-Mediterranean hier- 
archy is not maintained. Although the highest mean score is obtained 
b\ a Nordic group, the highest median is found in a Mediterranean 
group. Similar 1>, the rank -order of the racial groups within any one. 
nation is inconsistent. Itiiis in France the Mediterranean group is 
best, the Alpine intermediate, and the Nordic poorest; whereas in 
Ciermany the Nordic is superior to the Alpine sampling, and in Italy 
the Alpine is superior to the Mediterranean. The marked ovctlappino 
of groups, as indicated by the lange, should also be noted. When all 
Nordics. Alpines, and Mediterraneans are compared, regardless of 
nationality, the following mean scores are obtained: 

TnotUic 18 S^ 

Alpine 18"^ 

Med 'ten inca’ 185 2 

None of tliese dilTerences is statistically significant. The variations from 
one Nordic sample to anodiei, on the oilier hand, arc large and signifi- 
cant. The same is true of the other two lacial groups. Thus there is 
a dilTcrence of 24.4 points between French and Italian Mediterra- 
neans; one of 19.4 points between Cicrman and French Nordics; and 
one of 13.4 between (ierman and 1 icnch Alpines. 

The tests employed in this investigation are, of course, quite limited 
in the type of function which they measure. Moreover, within the age 

^Thc Kiu>\ C'ubc, and Iho Tnan^’lc, Healy A, T vvo-Figuit*, rivefigiut*. and 
Casuist loim boards 
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range covered by the study, individual differences in score may reflect 
largely differences in speed of work, since the tasks are relatively 
easy for older children and adolescents. A repetition of this study 
with improved measuring instruments made possible by current de- 
velopments in psychological testing would be a valuable addition to 
our understanding of group differences. 

Within the behavior sampled by the tests which were employed, 
Klineberg’s results did clearly demonstrate that the obtained dif- 
ferences among national groups could not be attributed to the “racial” 
composition or to the proportion of Nordics, Alpines, and Mediter- 
raneans in each country. Because of the variations found among dif- 
ferent samples of the same nation, Klincberg proposed that the 
differences may not even be national m scope, but should be en- 
visaged in terms of smaller cultural units. That the differences are 
the result of environmental rather than hereditary factors is suggested 
by two considerations. In the first place, the predominance of a single 
inbred family str.im in any one of the samplings tested is very un- 
likely because of the wide area covered. It will be recalled that from 
9 to 19 villages were Ciinvassed for each single sampling. In the 
second place, vers interesting parallelisms w^cre found ♦between the 
cultural, economic, and cdueational conditions in any one region 
and the intelligence test performance of its inhabitants. 

Although not concerned with national groupings, a re-analysis by 
Mann (40) ot data collected by Poiteus (45) may be included at 
this point, since this analysis is likewise based upon cioss-compansons 
among the same individuals classified with respect to biological and 
cultural criteria. Following a scries of investigations on seveial native 
groups in Australia and South Africa, Porteus had concluded that the 
Australian aboriginals were racially superior to the Africans in the 
functions measured by the Porteus Ma/e Tests. He argued against 
an environmental explanation ot the (obtained differences, on the 
grounds that the environment of the Austr<diim gioups w^as actually 
more '‘repressive” than that of (he Africans I’his assertion he based 
principally on the greater scarcity of food and water in the habitat 
of the Australian groups. In itself, such an inierence is questionable, 
since several of the African groups studied had to “contend not only 
with some of the most dangerous wild animals on earth but also with 
some of the fiercest native tribes,” while the Australians were un- 
molested bv either. 
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TABLE 57 Par tens Maze Test Performance by Native Peoples of 
Australia and South Africa 

in'iii Poiit'us, -IS. p 2*17, J'lifl ^Liiui, 40, p 


Tribal (i roups Classifuul 

Ac coni in i' to Rai c 

Tnhal (i roups Classified 

Ac r(>/dtni’ !<} Si hciohn.v Tacilitics 

Race N M A oa 

Ma/e Tests 

School inj^ ^ M c'lu M A < 

i'ucililics Ma/cTc'.ts 

Australian 128 10.89 

African 207 10.27 

diff./n,,,ff -- 1.8 

Mission or 

government 208 1 1 .20 

None 127 9.27 

difT. a.mr '"6.0 

1 


Tn coinpiuing ihc rr,,,,, , Mann used an approx. mation of the Sl'i’s owinj; to 
the fuel that he did not ha\c atci'S'' to the otipinal scoics Since ihe conliasi between 
the two tvpes of Lomp. 'isi>n is so smkinp, howevi'i, it is imlikelv th.ii the ii'inpuiu 
lioji of the picci.se SU’s in each case would have vdl'ccted the conclusion. 


Porlciis' data v\crc subsequently reclassified by Mann with respect 
to schooling opporlunilics. Some of the tribes tested had had access to 
mission or government schools, while others had not. When these 
tribes were grouped, lirst, in terms of rac'ial category (African or 
Australian), and secondly, on the basis of schooling facilities, the 
results shown in Table 57 were obtained. It is apparent that the 
former classification yields a small and rather insignificant differenee, 
while the latter gives a much larger dhference wliich is significant at 
a high level of confidenci". Test scores in this study were thu.s more 
closely related to scliooiing than to racial category. 

RACIAL versus cui factors in the 

DHVIiLOPMrNT OF FFR.SONALFrV 

Popular opinion has consistently attributed characteristic tempera- 
mental qualities to each race or nalioriahty Gnmp differences in per- 
sonality are held to be even greater than in ability, and the belief in 
such emotional diflcrcnces persists . when intellectual equaiit> is 
granted. Such familiar stereotypes as the Irish wit, the excitability of 
the South European or ^“Latin” groups, the easy-going nature of the 
American Negro, the stolidity of the American Indian, the composure 
of the Englishman, and a host of similar characterizations have become 
a part of our daily vocabulary. 
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In a number of investigations, paper-and-pencil personality tests 
have been administered to samplings of various groups living in this 
country, including European immigrants, Negroes, American Indians, 
and Orientals. Some investigators report no sigriificanl diiferences 
among the racial or national groups compared. Others have found 
slight differences, usually in liie direction expected from tradition and 
popular belief. On the whole, the resuks of these studies arc very diki- 
cult to interpret, partly because of the dubious validity of many of the 
tests, and partly because of the unrepresentative natuic of some of the 
samplings employed. The comparison of Negro and white college stu- 
dents, for example, would be subject to a difieiential operation of 
selective factors in the two groups (cf. Ch. 20). 

Moreover, on personality tests, even more than on tests of intel- 
lectual functions, a given lest item may have a dilfercnl meaning tor 
Negroes and whites — or Tir any groups with very dissimilar experi- 
ential backgrounds. Even if all specific terms in the item are inter- 
preted in an identical manner and with relerence to the same standard 
by Negroes and whites, the same response may have a dilferent diag- 
nostic or prc^gnostic sign’lieance when given by a Negiro and by a 
white subject. Tims the statement that one is being discriminated 
against by many oi his associates migln indicate undue suspiciousness 
or even paranoid tendencies in a white respondent, but it might indi- 
cate only a realistic awareness of social attitudes in a Negro re- 
spondent. 

An investigatiem conducted by Klincberg. E^ield, aud E’oley (34) is 
of special interest, since it rcpiesents another application ol tlic tech- 
nique of cr(}Ss-coi}iparisi)ns among cultural and biologicml groupings. 
Over 400 male and female studenis attending eight diifcrent institu- 
tions of collegiate rank in New York C'lly and its environs were exam- 
ined with a series of personality tests. The tests included the 
Bernreuter Personality Inventory, the Allporl- Vernon Study of 
Values, an honesty test (Mailer Jest of Sports and Hobbies), and 
two tests specially devised for use in this investigation, one to measure 
suggestibility and the other persistence.'' \ he subjects were classified 
into Nordic, Alpine, and Mediterranean groups on the basis of 
cephalic index, eye color, hair color, and skin color. 

The mean scores of Nordic, Alpine, and Mediterranean groups 
on each test are given in Table 58, the data for the two sexes 

F’or a fuller discussion of these tests, the reader is rcleiicd to Chapter 13, in 
which another pait ot the same investigation was lepoited. 
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being reported separately. On the Allport- Vernon Study of Values, 
only one of the differences is significant in each of the sex groups. 
Among the women there is a significant difference between Mediter- 
raneans and Alpines in the mean score for “aesthetic value.” This 
difference is 3.26 times as large as its standard error, the higher mean 
occurring in the Mediterranean group. Among the male students a 
significant difference was found between Nordic and Alpine groups in 
the mean score for “religious value.” This difference was in favor of 
the Nordics, the critical ratio being 4.64 Upon further analysis, both 
of these differences seemed to be rather closely linked with institu- 
tional groupings. Thus the highest mean score for “religious value” 
was obtained in a Catholic college for men in which were found only 
three Alpines. This would tend to pull down the mean of the Alpines 
in relation to those of the other two racial groups. Similarly, among 
the female subjects, the highest scores in “ivsthctic value” were ob- 
tained in an institution which encourages the aesthetic attitude, as is 
evidenced by a large and popular art department. This institution 
furnished a relatively large number of Mediterraneans, thus raising 
the mean “testhetic value” score of the latter group. None of the other 
Ailport- Vernon scores yielded significant differences between racial 
groups. 

None of the differences in the six Bernreuter scores proved to be 
statistically significant in cither male or female group. Likewise, in the 
three remaining tests, i.e.. suggestibility, honesty, and persistence, no 
significant group differences were found. 

Jt is apparent that in the personality traits measured in this study 
the differences among Nordics, Alpines, and Mediterraneans within 
college samplings are very slight. Nor can it be argued that the lack 
of differentiation among these groups was due to the homogeneity 
of college students in the characteristics under investigation. Although 
relatively homogeneous in intellectual traits, college students exhibit 
large individual differences in personality development. This is borne 
out by the very wide ranges and SD’s found within each group. It may 
also be mentioned that, as a result of the wide range covered by each 
group, a large and almost complete overlapping of the distributions 
of Nordics, Alpines, and Mediterraneans was obtained on each test. 

In sharp contrast to the predominantly small and insignificant dif- 
ferences found between racial groups, many large and significant differ- 
ences in mean score were obtained among the academic institutions 
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covered by this investigation. Several of these differences were many 
times larger than would be required to meet the usual standards of 
statistical significance. In both male and female samplings, the Allport- 
Vernon scores showed the largest differences. These differences agreed 
closely with well-known characteristics of the institutions under con- 
sideration. Ihus in ‘‘religious value,” the one Catholic college in the 
group obtained the highest mean score; the lowest mean was found 
in an institution whose student body was traditionally radical, agnos- 
tic, and of relatively low socio-economic level. The difference between 
these two means was 14.49 time^^ as large as its standard error. It is 
interesting to ncHc that another very large difference was obtained 
between the same two institutional groups in “theoretical value.” In 
this case, however, the difference was in favor of the lattci group, the 
critical ratio being "^18. On the Bernreutcr scales the differences were 
not so marked, although many were statistically significant. The tests 
of suggestibility, honesty, and persistence yielded relatively small and 
insignificant differences. 

Whatever the cause of these institutional differences, it cannot be 
“race” in the biological sense, since the differences disappear when 
individuals arc classified according to the physical criteria of race. 
The explanation of these personality differences from one institution 
to another is not difficult to find. In the first place, selection obviously 
operates in the students’ enrollment in any particular institution. Indi- 
viduals with certain altitudes and emotu>nal characteristics will be 
more readily attracted to osc institutions which are by tradition con- 
genial to such traits. The cwidcncc indicates, however, that such selec- 
tion operates on the basis of the economic and cultural group in 
which the individual was rcaicd radicr than in terms of race. In the 
second place, attendant e in a particular institution will itself foster the 
development of certain personality traits through the resulting social 
contacts and other direct stimulating circumstances. 

In recent years, much has been said and written about ‘^national 
character:^ The broad geographical scope of World War II brought 
about a sudden realization of the nec I for more knowledge regarding 
the customs, attitudes, and other psychological characteristics of many 
different cultures, including both allies and enemy nations. If we 
clearly recognize that “national character” is a cultural rather than a 
racial concept, the study of such national differences will not only 
yield results of practical value, but may also contribute to a better 
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understanding of the nature and causes of such group differences. 
Many of the techniques available for suoh comparative studies of dif' 
ferent cultures, as well as some of the pitfalls to avoid, have been 
summarized by Klineberg (33), 

Some psychologists and anthropologists consider the Rorschach test 
to be a promising instrument in this lieki of research, because of its 
relative independence of language and other culturally restricted con- 
tent (23). Preliminary results have been reported on a number of 
American Indian groups and other cultures in which questionnaire 
methods would be quite unsuitable. The Rorschach test was also in- 
eluded by C, DuHois (10) in her intensive field study of the people of 
Alor, an island in the Netherlands Last Indies. It must be remembered 
that the validity of many of the proposed diagnostic interpretations of 
specific Rorschach responses has not \el been satisfactorily estab- 
lished, even within any particular segment of our own culture. To 
what extent the various response chaiactcristics may have the same 
significance in diflercnt cultures is also a matter that requires further 
study. 

Among other available techniques cited by Klineberg (^3 ) are those 
utilizing “laboratory" or performance-type tests, in which the sub- 
ject's response to such situations as failure or frustralion is observed. 
Considerable care musi be '.xercised in generalizing from such a test, 
however, since the specific tasks may vary in importance for individ- 
uals of different cultures and thus nunivation may not be comparable.. 
Another possible source of data is to be found in the many descriptive 
accounts of “national character” which have appeared, including both 
the more journalistic, popular reports and the more technical surveys 
by anthropologists and sociologists."’ The analysis of cultural products 
— such as humor, drama, moving pictures, literature, and popular 
songs — has also been a favorite ap[)roach. Even the examination of 

Hoiicdicl <4), Cioicr i22), Lyiul (3^)), Me:ul (42) I'oi <.)ilitT refer- 
ences. cf. Cailin (21 ). 

In 1948, an extensive piojcct on national tharactet was begun by a group of 
Colunibia l/'iiucisity anilii opokigists. under the auspices ol the Psychological Mranch 
of the Medical Sciences I^ivision of Ibe OHicc of Naval Research (cl. 36). This 
project, which was onginally under the diitclion of Dr. Rulli Benedict, involves the 
appl.cation t)f anthiopological methods to the study ol a number ol conterapoiaiy 
literate cultures. Studies on immigiant groups in New York City arc being supple- 
mented by field stiidie.s in the counlncs concerned. P,^ychologIcal tests, intei views 
with representatives of diflercnt groups, and the analysis ol cultural producks arc 
among the techniques being employed. 
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existing stereotypes may be helpful, if the stereotypes are recognized 
as such and arc used only as leads for further analyses. Certain vital 
statistics, such as the Irequency of psychoses or crimes of various 
types, may likewise provide useful data if supplemented by other 
information. 1^'or example, the relative frequency of homicide in 
defense of family honor in one culture, or of suicide in defense of 
individual honor in another, may furnish fruitful clues to the une’er- 
standing of other characteristic behavior. 

A method developed principally h\ anthropologists for use in rela- 
tively simple cultures, but subsequently extended to the analysis of 
national dilferences, is based upon a study of the child-training prac- 
iices followed by dilTerent peoples (cf., e.g., 21, 22, 37). Feeding 
schedules, methods of toilet training, disciplinary techniques, and 
other child-rearing procedures arc compared among diiferent cul- 
tures. Some investigators have ch'iimed that the characteristic adult 
attitudes in any one culture may depend in part u])on the degree to 
which such childhood experiences were characterized by austerity, 
rigidity, aloofness, informality, emotional warmth, and the like. The 
available evidence for such claims, however, is exlreinely meager and 
of dubious significance (cf. 43). 

All these methods represent highly tentative approaches to the 
study of personality dilTcrcnccs among cultural groups. Many are 
rather subjective and likely to reflect what the investigator expected 
to find. Another point to consider is tliat cultures arc not hcanogenous. 
This is particularly true of modern nations, which represent a varied 
array of local “regional C ’raciers” in difTcrent sub-groups. As condi- 
tions change, moreover, “national character'’ may change. The de- 
scriptions cannot be expected remain fixed, although certain 
features may persist. 

Cil STI RM AN EXAMPLi: OF CULTURAL ASSIMILATION 

It has often been maintained that racial groups manifest characteristic 
bodily attitudes and movements. The habitual postures, peculiar walk, 
and other traditional motor habits .n various groups have been de- 
scribed at great length. Attention has also been called to the large 
group differences in the speed and tempo of movement. Special inter- 
est, however, has always been aitaehed to the gestwal behavior of 
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different peoples. The frequent emotional connotations of gestures, 
their peculiar relationship to language, and the easily observable 
differences in the traditional gesture patterns of various groups have 
made their study a particularly fascinating one. A voluminous litera- 
ture has accumulated on this subject most of the writings being either 
purely descriptive or speculative in nature. Artists, historians, philoso- 
phers, anthropologists, and many others have contributed their obser- 
vations or theories to this topic (cf., eg.. 1 1, 35). The layman, depend- 
ing upon his mood and disposition, is amused, estranged, or repelled 
by the spectacle of a gestural pattern too unlike his own. In popular 
thought, gesture has been linked with underlying personality differ- 
ences among racial groups. As a result this phase of motor behavior 
has acquired a special significance in discussions of race differences. 

A suggestive approach to the study of characteristic 'TaciaT’ ges- 
tures is to be found in an investigation by Ffron and Folev (11, 12). 
The groups employed w'crc: ( 1 ) 'traditional’' Italians living in "Little 
Italy,” one of the Italian districts in New York City; (2 ) "traditional” 
Jews living in New York’^ lower Fast vSidc; and (3) "assimilated” 
Italians and Jews, both living in similar "Amcricani/ed" environ- 
ments. In view of the wide diveisification in bcha\ior paTterns among 
different samplings of Italian and Jewish subjects, ffic authors further 
specify that the Jews included in this investigation were predominantly 
ol Lithuanian or Polish exJ' action, and the Italiiins were from south- 
ern Italy, chieffy fiorn the vicinity of Naples and from Sicily. The 
findings arc thus icsirictcd to these particular gioujis. Similarly, the 
results arc to be qualified by the fact that only immigrant groups in 
America were employed. 

The gestural behavior of these subjects was investigated by the fol- 
lowing methods: ( I ) direct observation and description, (2) sketches 
made by an artist, and (3) motion pictures. All three methods were 
applied to gesticulation occurring in everyday litc situations, the sub- 
jects being unaware of tlie fact that they were being observed. The 
motion picture matciial w'as subjected to two types of analysis. In the 
first place, the films were shown to naive observers w'ho were asked to 
judge various chaiaclcnstics of the movements. The second method 
was more quantitative. T he film, taken with a constant-speed moving 
picture camera, was projected frame by frame upon coc^rdinate paper. 
The positions of motile parts, such as fingers, wrist, elbows, etc., were 
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marked in successive frame projections When these points were 
joined, a precise representation of the gestural behavior pattern was 
obtained bicuic 95 illustrates this graphic technique in the case of a 
traditional Italian. It will be noted that there arc tour distinct lines of 


motion portraved, the continuous 
lines representing the paths of 
movements of the right and left 
wrists, and the bioken lines depict- 
ing the accompanving motion^ ol 
the respective elbows The numbers 
indicate the direction ol movement 
representing Hk position ot the 
gi\cn pirt in each sueccssoc l»amc 
projection 

A study of the cur\es con 
structed by this technique as wch 
as a considei ition ot the d ila col- 
lected b> the odier mc'^rc <ju ihl i- 
live methoeb, led to two pMn( ipJ 
conelusioP'' Tii>t, ilea^K dislin 
gMishablc tind eharacteiistic ges- 
Uiial patterns were e diibited b} 
the traditional Itihm ana h 
groups Some ot the major J 1- 
fciences between these ^ itt( rn 



rig. 95. Cifup lie leehiiiqiic Em- 
ploved in the Ai.c iv is ol Cjestural 
Be hi VI )i (I jom tlKMi tind ?olev, 
12, r J 4) 

1 ^ be summanzect as follows: 


An aiialNMS oi the p ns >/ the Ifoch invoK d m acstieiilatins levealcd 
that the It ill in te ids to use prefer h^y his iims when, is the lew tie- 
qiicntlv emploNs hn he id is ^ ell i'* his aims h in E and fineei^, in a 
functionally dillereiUi ii Iw \ lie id and lingei gestur^^^ a* e lather Upi^-al 
ol thu lewish espiessvc laoM. ments 

The fofni oi // ^ / iOMnunt dso showed t i i iihed contr isl henween the 
two groups Jn the Jew the mosenients au olnm ^inuous anei change di 
leetion iicquentlv the Iiihin is mire iiulinv.J to continue in the same 
diicetion until eompletiem ol the cn^ ‘Gesture se ^ment 

In icgard to latnahty (i e unihterai or bilalcril) as well as 
of movement, pionounecd d’herenecs wcie noted File Jewish gestiiic is 
predominanllv asvmmetiical, with lieaiient crossings and interim inings 
Gestieulauon is usualh c\ceuted witl one hand and <irm, and if two are 
used they aro employed in a sequcntiil ratliei than a simultaneous fashion. 
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The Italian, on the other hand, frequently uses two arms simultaneously, 
and the movements arc highly symmetrical in character. 

The radius of the movement differed in the two groups, the Jew em- 
ploying a relatively confined area, while the Italian sweep was loiind to be 
characteristically large, with movements involving the entire arm. 

The two groups likewise varied in the area in which i^csiicuUnion occins, 
the Jewish group seldom deviating from the medial plane of the body, 
whereas the Italian is more likely to perform his movements within the 
lateral areas. 

Within each of these general areas, a dilTercncc was found in the dnre- 
lion of the gestural movements themselves, the Jewish movements being 
more frequently toward, and the Italian away from, the body of the 
gesture r. 

Significant differences were likewise noted in rhvfhiu or ttnipo, the 
Jewish movemenis being characteristically jerky, sporadic, and variable, 
while those of the Italians aie more even and less variable. 

1 he frame of rcicrencc cl the gestures also dilfered. The Jewish gestuies 
are more likely to he direcied toward the body ol the person addressed as 
a "/7o//// dc rcpcic," the speaker trequenily touching the auditor, or liter- 
ally ‘'hutlonholiiuf’ him. In contiast, the Italian gestures are typically 
oriented around (he body of the speaker as a frame of reference 

In addition to these spatio-temporal characleristies of the gestural 
movements themselves, certain major differences were obNcrved in icgard 
to tlie meaningful or linguistic function ol such gestures, fhe Jewdsh ges- 
tures were chaiacterislicaily of the discuisivc or logical type, being, as it 
WTre, a gestural poriraval not of the object of rekrence or thouglu, but 
of the process of ideation itself. 1 his discursive or logical type is absent 
among the traclitiona! Italians, whose gestmes are 1rec}iientlv pictorial or 
paniomnnic. the latter being a sort of re-cnaelrnent imitation of the 
actions \ei bally described. Purely symbolic gestures are also common 
uniong tne traditional Italian, and convey definite meaningful associations. 
These may he used to accompany verbal intercourse or may even function 
as the evciLisive means <if comnumication. 

The second major point brought out by this investigation was tliat 
all the abo\'e characteristics of the traditional Italian and Jewish 
groups tended to disappear in the ‘'assimilated” groups. In general, the 
more assimilated the individual, the less his gtssltiral chaiacteiistics re- 
sembled those of tradilu^nal Jew^ish or Italian groups. traditional 
differences between .Jewish and Italian gestures were absent in the 
fully assimilated groups, and both resembled the particular ‘'Ameri- 
can” group with which they had become associated. On the whole, 
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gesticulation was much less jrequent in such assimilated groups. The 
dilTercnccs in gestural behavior between traditional groups and the 
lack of such diilercnces between assimilated groups could not, furtlicr- 
more, be explained on llie basis of native or foreign birth. It was 
found, for example, that the American-born students at an orthodox 
Jewish school in New York City exhibited the gestural behavior of 
the traditional groups obseived in the lower Hast Side, while the 
American-born Jewish subjects obtained at an exclusive Fifth Avenue 
club showed no such traditional gestures. In summary, a marked dis- 
parity was found between most of the gestural patterns ciiaractcrislic 
of the traditional Jewish and Italian groups investigated, but no such 
contrasting gestural patterns were noted in assimilated groups of the 
same “raciak' extraction. I bus cultural stimulation or habituation, 
ratlier than so-called racial descent, seems to be operative in the 
development of geso.ire. 

CONCLUDINO r^VAl.UAllON 

In the two preceding chapters v\e lune noted the many dinicultics 
which beset the study of race dilTerences in psychological traits. Race, 
deiined as a biol<’)gically distinct group dillcicntiaied by common in- 
nate physical characteristics, is a dillicult category to apply to con- 
temporary man. In the attempt to arrive at a eha sihcaiion of human 
races, one proposed criterion after another has proved inadequate. An 
analy^is of the major alleged physical diilerentia of race reveals wide 
varialicm within a single ^rou[), overlapping of groups, inconsistency 
with other criteria, and susceptibility to eiivironiiiental iniluenccs. One 
or more of these criticisms can be ! veled against each of the proposed 
criteria. Thus even the best posi ide elassilication of races is to be 
regarded as tentative and approximate In fact, the very concept of 
race could be questioned on both theoretical and empirical grounds. 

Race mixture, which \v\< been going on for many generations, also 
adds to the complexity of tlie pioblem. I'he issue is further confused 
by the testing of immigrant group- v’. iiich may not be representative 
samplings of iheir naiioiial populations. Moreover, immigrants are 
likely to be undergoing a period of intense readjustment and conflict 
arising from their coiUaets with the new culture, and lliis cannot fail 
to alTccl their Ivdiavior in many wavS. 

The problem of testing and comparing racial groups also presents 
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serious difficulties. Members of different races usually differ in many 
other respects as well. These differences often make direct comparison 
of behavior impossible. Thus language handicap has been shown to 
have a marked influence upon mental test performance. The subject's 
reaction to an examiner of a different race, the establishment ot “rap- 
port,” the use of pantomime or of pictures which may not be equally 
familiar to all groups, all make the administration of tests a difficult 
task. The racial groups to be compared, furthermore, may not be 
equated in educational opportunities t nd facilities, socio-economic 
status, and the general cultural milieu in which they live. The special 
traditions, customs, and interests characteristic of each group may 
further “interfere” with test responses. Finally, it is impossible to 
establish a hierarchy of groups in terms of absolute intellectual supe- 
riority or inferiority. “Intelligence” tests measure certain abilities re- 
quired for success in the paiticular culture in which they were 
developed. Cultures differ in the specific activities which they encour- 
age, stimulate, and value. The “higher mental processes” of one 
culture may be the relatively useless “stunts” of another. 

In so far as the members of different races live under vaiied cul- 
tural conditions, it is extremely difficult to compare thetff directly and 
impossible to determine the relative contribution of hei editary and 
environmental factors in producing any behavioral difleiences among 
them. In a few invcstigatio «s, which have been reported in the present 
chapter, it was found possible to make cros^-\.omparisons among racial 
and cultural groupings. In so far as these two categories, lacc and cul- 
ture, cut across c 'ch other, it is posable 1o tease out the relative iiillu- 
cnce of biological and environmental factors. The results ol such 
investigations arc highlv suggestive. 

It would be prermituie, of course, to hazard any conclusive state- 
ments on so complex a problem, tui the bulk of the evidence is defi- 
nitely against the existence ol beh<ivior<il differences among “laces” 
in the biological sense. It is misleading to conclude that to date inves- 
tigators have incKly /t (h'd to ptoxe race dfflcrenees in behavior. The 
present state of our kncjwledgc on this question is nc'l a complete 
blank; nor is the evidence perleclly balanced, with half the data 
favoring a racial hypothesis and halt a cultural hypothesis. It is a fact 
that there are group differemes in hehaxior, but not tnat such differ- 
ences are racial or biological in origin. There is a considerable body of 
data, both in the racial studies and in other more general investigations 
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on the origins of individual differences in behavior, to show the influ- 
ence of environmental factors in behavior development. But no study 
has conclusively demonstrated a necessary association between be- 
havior characteristics and race as such. 

To determine whether or not a behavior difference is truly racial 
logically implies three questions First, is the bcha\ ior difference under 
consideration traceable to a structural difference'^ If so, is this struc- 
tural characteristic gene-determined, i.e., not the result ot dietary fac- 
tors, birth injuries, or other environmental conditions? If both of the 
above questions arc ansv.cred allirmatively, the fmal question is: Can 
a linkage be demonstrated between the genes determining this struc- 
tural characteristic and the genes determining such racial characteris- 
tics as skin color, cephalic index, hair quality, and other commonly 
used criteria of racial classification'^ A negative answer to one of 
these three questions precludes a racial interpretation ol the observed 
behavioral difference. 




CHAPTER 

23 


Socio-Ecouom ic 
Differences 

One of the principal shortcomings of most efforts to describe or 
understand “national character” is their tendency to gloss over impor- 
tant diflerenccs among cultural sub-groups within a nation. Moreover, 
the accounts arc sometimes based, not upon the common features of 
the national culture, but upon an ovcrgencralizcd picture of the par- 
ticular sub-group with which the investigator was most familiar. In 
America, such broad regions as the New England States, the South, 
or the Midwest will be readily recognized as differing in more than 
geography. 

1 he distinction between “city” and “country” is likewise a familiar 
one. Even the casual observer is aware of significant differences be- 
tween the urban and the rural dweller, not only in abilities, but also 
in interests, emotional responses, and general outlook. Actually this 
division is not a twofold one, but includes a series of groups, each dif- 
fering from the others in distinct ways. From the large metropolis, 
through the moderately huge city, the small town, the village with its 
one general store and post office, to the open country and the isolated 
mountain community, there are t( be found numy degrees and types 
of variation. The extremes of this series present definitely contrasting 
psychological pictures. Among the iiitci mediate and more nearly adja- 
cent members, there may not be a very pronounced intellectual 
variation, but in such cases well-known personality differences are 
often found. Thus the attitudes and emotional traits of the isolated 
mountain dweller and of the inhabitants of a small village may be 
fundamentally diverse. Similarly, between the resident of a large city 
and the member of a small town community there exist differences in 
outlook which have been repeatedly described and dramatized in 
literature. 

Another kind of cultural grouping whose importance is receiving 
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increasing recognition is that represented by social classes. Sociological 
research in American communities has demonstrated not only the 
well-nigh universal prevalence of such social stratification, but also 
the profound effect which the individual’s class membership may have 
upon his behavior development. The chief difference between a rigid 
“caste system” and the class systems found in a democracy such as 
that of the United States is the greater degree of “social mobility” 
possible in the latter.^ Thus it is possible for the individual in a lower 
social class to rise to a higher status thro^igh his own efforts. It is this 
possibility which is at the root of many of the characteristic motiva- 
tions and attitudes of the “middle class,” with its emphasis upon hard 
work, self-improvement, and attainment. 

An interesting practical application of the concepts of social strati- 
fication and class status is to be found in industry. The modern indus- 
trial psychologist recognizes that the plant personnel is structured 
into status groups or classes, in much the same way as any other com- 
munity (cf. 76, 97). Not oiil> occupational titles and wages, but dis- 
tribution of working hours, characteristic wearing apparel and insignia, 
type of chair or desk, and almost any item or event in the working 
environment can become associated with these social disfSnctions and 
thus serve as a “prestige symbol.” Any change which threatens to 
disrupt the individuaks position in such a prestige scale may have a 
very demoralizing clTcct. 

C LASS S'lTUfCTUKL AND PSYCHOLOGIC At. DLVLLOPMENT 

The class differentiatiem of American society has been vividly demon- 
strated in a series of sociological studies conducted in certain typical 
American towns. These towns have become familiar by the pseu- 
donyms given to them by the investigators: from the Middle West we 
have Middletown (58, 59), Plain\illt\ LLS.A. (98), and Prairie City 
(38 ) ; from New England, Yankee City (94, 95 ) and from the South, 
Old City (16). In ail these studies, the research meihcxls employed 
were similar to those developed by social anthropologists in their field 

^ It has been pointed out lhat tJie social position ocLiipicd by such ethnic minority 
groups as the Amcucan Ncgio is nioic naaily lhat of a caste than that of a class. 
Even within such a ‘'caste,” howevci, a fiirlhci class stiatiHcution based primal ily 
upon socio-economic factors is found (cf. ib, 19). 

^The entire “Yankee C'lly Senes” comprises six volumes, four of which have 
appeared to dale (94, 95, 9(i, 97); only the iiist two volumes aic primarily con- 
cerned With class differentiation. 
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studies of preliterale cultures. The investigators lived for an extended 
period in the particular town, taking part in its social activities. Many 
local residents of varied socio-economic level were interviewed and 
their behavior in social situations was observed. By such techniques, 
information was obtained not only on the prevailing class concepts 
and criteria, but also on the social status of specific persons in relation 
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Fig. 96 Social Stratificaliop. in an American C ommunity: "Yiiiikee City." 
(From Warner and Lunt, 94, p. <S8.) 

to other persons. The invesiigalors wanted to discover “who associates 
with whom,” and in what capacity. It was primarily on the basis of 
such information regardin ' social participation that the status classifi- 
cation of each individual w.is determined. Once this had been done, 
it was then possible to check such characteristics as income, property, 
education, church and club membership, and other factors which 
might dilTcrentiatc the status categories. In general, the results of these 
surveys indicate a stratification into three major classes, each being 
further subdivided into two sub-classes, as shown in Figure 96. The 
per cent of persons falling into each of the six categories within a 
sampling of 16,785 persons investigated in Yankee City is also indi- 
cated in the figure. The relative proportion of persons in each status 
class did not differ substantially in the other towns studied. 

These class distinctions were based largely on occupation and 
income level, although such factors as family background, education, 
beliefs and attitudes, and moral standards provided additional criteria. 
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The distinction between upper-upper and lower-upper in the New 
England and southern towns was made primarily in terms of family 
background, the upper-uppers representing the “old aristocracy,” and 
the lower-uppers the “newly rich.” In the mid western communities, 
this distinction was not generally made, there being only one “upper” 
class comprising the wealthiest and most prominent families. The 
upper-middle class consisted principally of business and professional 
people, the “pillars of society,” while the lower-middle class included 
small tradesmen, “white collar workers,' and some skilled labor. The 
upper-lowers, consisting largely of semi-skilled and unskilled workers, 
were often described by middle-class persons as “poor but respect- 
able” and “hardworking people.'’ In contrast, the lower-lowers were 
characterized as shiftless and disorderly. The upper three classes to- 
gether constituted “the big people” of the town, the lower three classes 
being regarded as “the little people.” 

An interesting by-product of these studies was provided by the 
comparison of class concepts held b> individuals in dilTerent levels of 
the social hierarchy. Figure 97 illustrates these dilTerences as found in 
the population of Old O/v. It \^ill be noted that groups more remote 
from the informant arc sometimes classed together, but finer distinc- 
tions arc made in the vicinity of the individuars own class. The sharp- 
ness of the various divisions also dillers with the group to which the 
informant belongs, as indie ^ited by the number and position of the 
solid and broken lines in the diagram. The most conspicuous finding, 
however, is the frequency with which derogatory terms are used when 
describing other people's classes, and laudatory terms when describing 
one's own. The same class looks very dillcrcnt when view^ed from the 
top, the middle, or the bottom of the ladder! 

Of special interest to the dilicicnlial psychologist are the elTccts 
which social class membership mav have upon the individu d’s emo- 
tional and intellectual development. In his analysis of personality, 
Murphy (67) has maintained that the social classes show distinct 
“psychological cleavage ” or discontinuity, and that these cleavages 
are reflected in peisonality structure. All surveys have corroborated 
the fact that these classes represent distinct cultural units. The type 
and extent of social contact between the various classes is definitely 
restricted. Moreover, the class stratification is rcilcctcd ii large differ- 
ences in home life, education, recreational outlets, reading habits, 
religious observance, and political activity. 
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Also relevant in this connection are the data collected by Kinsey, 
Pomeroy, and Martin (47) on male sexual behavior. On the basis of 
their intensive interviews with 6300 American men, the investigators 
were strongly impressed by the relationship between socio-economic 
level and sexual behavior. For example, the lower socio-economic 
classes report a higher incidence of pre-marital and extra-marital 
sexual relations than the higher socio-economic classes, but masturba- 
tion is more frequently reported in the higher socio-economic levels. 
Upper-class males also respond eroticaliy to a wider range of stimuli 
than do lower-class males. The investigators themselves regard such 
socio-economic dilTcrenccs as one of the basic findings of their survey. 
They write: 

The data now available show that patterns of sexual behavior may be 
strikingly dilTerent for the different social levels that exist in the same 
city or town, and sometimes in immediately adjacent sections of a single 
community. The data show that divergencies in the sexual patterns of such 
social groups may be as great as those which anthropologists have found 
between the sexual patterns of different racial groups in remote parts of 
the woild. Theie is no American pattern of sexual behavior, but scores of 
patterns, each of which is confined to a particular scghient of our 
society (47, p. .129). 

To be sure, these results may reflect no more than the degree of 
willingness or reluctance of American men in different socio-economic 
classes to report certain sexual activities. Even if this is the case, how- 
ever, the data would indicate certain socio-economic differences in 
attitudes toward various forms of sexual behavior. * 

A number of investigators have called attention to dilTerenccs in 
child-rearing praciices among social classes. Davis and Jfavighurst 
(17) studied this question by means of intensive interviews of upper- 
middle and upper-lower class families in Chicago. The interviews 
covered such matters as feeding schedules, toilet training, daytime 
naps, going out alone, hour at which child is required to be home at 
night, and age at which ihe child is expected to assume various respon- 
sibilities. Several statistically significant dilfcrenccs were found within 
both the white and Negro groups studied. The differences were such as 

TTic results of this study should iilso be qualified in the light of certain selective 
factors' which tended to make Ihe samphiigs unrepresentative of iho general popula- 
tion. However, since the reported ditlcicnccs between socio-economic groups are so 
large, it is unlikely that Ihe geneial conclusion cited above would have been sub- 
stantially altered by the use of more highly icpresentative samples. 
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to suggest that middle-class parents tend to be more rigorous in their 
child-training practices, frustrate the child more in feeding and clean- 
liness training, and expect children to take responsibility earlier. The 
authors are of the opinion that these class dilTerences in child-rearing 
practices may alfcet subsequent personality development. Other inves- 
tigators have shown that the Umj^na^e development of children is 
closely related to socio-economic status (cf. 63). 

Davis (14, 15) has repeatedly discussed the many discrepancies in 
the type of training received by children of dilTerent socio-economic 
classes and the possible implications of these inequalities for intellec- 
tual and emotional development. Such dilTerences range all the way 
ftom the eating habits and the type of clothing worn on dilTerent occa- 
sions to the choice of playmates and the individLiaPs educational and 
vocational goals. Davis further maintains tha^ the public scliools arc 
primarily adapted to the middle-class culture, since educational per- 
sonnel is recruited principally from the middle class. This situation, 
according to Davis, makes the curriculum, type of incentives, and 
other aspects of the educational experience provided by the schools 
unsuited to lower-class children. He suggests that thi^ may be an 
important reason f(^r the frequent school maladjustment and educa- 
tional backwardness of these children. The evidence docs show that 
school achievement is positively correlated with socio-economic status 
(cf. 30, 31, 32). 

Surveys by means of personality tests, questionnaires, and opinion- 
polling techniques have tended to substantiate the class dilTerences 
which would be expected on the basis o1 cultural dillercntials. On 
neurotic inventories, school children from lower soeio-ecemomic levels 
have shown more evidence of mala> justment than those from middle 
and upper levels (6, 7. 30). Moreover, these class differences were 
found to be larger and more reliable than dilTerences betw'ccn native 
and foreign groups, or between urban and rural groups, tested in the 
same investigation. When groups of comparable socio-economic level 
were selected, the national and urban-rural differences tended to 
disappear. 

In another investigation (60), children of professional fathers were 
found to be more dominant, extroverted, and emotionally stable, 
whereas children of skilled laborers had more worries. Some data are 
also available which sugi^est possible socio-economic differences in 
adolescent “p^’^^tige factors” and in the attitudes of adolescents 
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toward their age-mates (cf. 1 ). An intensive investigation of 16-year- 
olds in Prairie City (38) likewise indicated the role of social class in 
adolescent character development. 

It should be noted thal, when comparisons are made betv/een se- 
lected groups from different social classes, the class differences may 
be obscured by the differential operation of selective factors (cf. 
Ch. 20). Thus in a comparison of urban and rural college women on 
a personality inventory, no significant difference in total adjustment 
score was found betw^ecn the two groups (75). It may well be, how- 
ever, that those rural girls who go to college are the \ery ones who 
most nearly resemble urban girls in their behavior, and whose cul- 
tural background has been most similar to that typical of an urban 
environment. It is interesting to observe that, despite such probable 
selective factor*!, certain items still showed large differences between 
the two groups. Moreover, the extreme scores, indicative of the best 
and poorest adjustment in the sample studied, were found in the urban 
group. 

The same type of differential selection probably operated in an 
intensive stud} of personality and economic background conducted 
by Davidson (13). Several standardized personality tcsts^verc admin- 
istered to 102 children between the ages of 9 and 12, whose lO’s 
ranged from 120 to 200. Socio-economic status, as determined by 
family income level, showed no significant relation to the large major- 
ity of personality indices employed in the study. Again it may be 
argued thal children who score so high on intelligence tests represent 
a different selection from the upper than from the lower social levels. 
Such children may have been exposed to more nearly similar environ- 
ments than would be true of the different social classes in their entire- 
ties. In this study, too, the differences were reduced but not eliminated 
by selective sampling. Certain characteristics did show a significant 
relationship to incvnne level. Among such characteristics were reading 
preferences and habits, recreational preferences, possession of fears, 
and “liberalism” in social issues. 

A direct and thorough approach to status differences in personality 
is represented by the investigations of Gough (31, 32) on high school 
students. Within a group of 223 high school seniors in a midwestern 
city, two extreme socio-economic samples were chosen on the basis 
of scores on the Sims Score Card for Socio-Economic Status.^ An 

For a further discussion of this scale, cf. pp. 801 802. 
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item analysis, based on the responses of these two samples on the 550 
Items of the Minnesota Multiphasic Personality Inventory, revealed 
34 Items winch yielded significant socio-economic differences (31) 
An examination of these items suggests that students of higher socio- 
economic level show stronger literary and aitistic interests, have more 
social poise, security and confidence in thcmschcs and others report 
fewer fears and anxieties display more “emancipated and ‘frank’ 
attitudes in moral religious and sexual matters, and are inclined to 
be more positive, dogmatic and self-righteous in their opinions 
The 34 differentiating items were grouped into a * status scale,” 
from which the personality status scores of another group of 263 
students were eompiilid 1 hese st it is scores eonelated ^0 with ‘ ob- 
jective status scores bised on chaiaeleitisiies of home biekgiound 
Moieo\er the corrslalions ot the otrsonihlx si itus scores wilh each 
of a number ol olhsi v in ibies clos \\ p ir illeled the pitlein )l c orrela- 
lions ot home status with the same vaiiables Ihe variables with which 
each of these two t\pcs of status scores were eorrehted included each 
ot the other se lies of th^ Minne ota MuUi])hasie Personalit> Invcn 
torv, a^ well as other person ility tests mlelligenec ind achievement 
test 5, and aeademie giadvs (32) Ihese correl itions further suggested 
that studenS of hiahei sos^i U stilus ^^how more satisfactory social 
adjustment less insecinity and kss social intro\eis]c n dian do lower 
status students 1 he compiiison ot person ilit) status and objective 
status scores sugf^ests interestma possibilities for the picdietion ot social 
mob’litv in indiMdud cise thus discrepancies between the person- 
ality status score and cl ctivc status score n i\ be lelated to the 
indivuhi il > Undenev to riSv. or drop m the social h erauh^ I or exam- 
ple, an individual wi^h low objceln a status score but high personality 
status score might be more hkc]> ^ i go to <.ollegL than one with low 
Status scores in both respects If this hypothesis is ventied, it might 
help to expl un the icl uively small personality test diffcrencLS between 
socio-economic groups which are found when selected populations 
are comp irc 1 as m the pievioush discuss^.d studies 

It IS now well known that occu i itional groups exhibit character- 
istic diderenees in inUtc sts not only ip strictly vocational matters, but 
in almost all areas ot evcr\day life activity These ditkrcnees aie, m 
^acl, the foundation upon which such tests as the Strong \ oeational 

Peisoiidl comnuinic ition from Dr H <j Cfoueh Cf ilso H Ci (loueh A New 
Dimension of St It IS 111 D scicp incics between the St Sc aL md Ob|ecti\t Status’ 
Atmr Social R( 1949 U 27^ '^81 
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Interest Blank have been constructed. An even more relevant finding 
reported by Strong (89) pertains to the marked differences in interest 
pattern found between difl’ercnt occupational levels. Strong has devised 
a special scoring key for measuring the occupational level (O. L.) of 
the individual’s interests. This was done by selecting those items which 
differentiated most clearly among such levels. The O. L. score is an 
index of how “aristocratic” the individual’s interests are, or how far 
they diverge from the interests typical of unskilled laborers. Not only 
docs this score differ with the position of the individual’s occupation 
in the socio-economic hierarchy, but within any one occupation it tends 
to be higher for those men whose work is of a managerial character. 

Some of the largest and most consistent class differences have been 
found in attitude surveys. Nation-wide polling studies (11, 55 ) as well 
as more intensive investigations in local areas, such as New Haven, 
Connecticut (64), and Akron, Ohio (43), agree in finding higher 
socio-economic level to be associated with more conservative atti- 
tudes, and lower socio-economic level w ith more radical attitudes. As 
one might expect, individuals who already occupy a more favored 
position in the social ladder tend to favor the preservation of the status 
quo. In general, loo, middle-class persons are more co?icerned with 
advancement along vocational and other lines, while the lower classes 
emphasize security (64). 

One of the most carefi ily controlled surveys on the attitudes of 
different social classes was conducted by Centers ( I 1 ). A representa- 
tive cross-section of the adult white male population, totaling 1100 
persons, was interviewed during the summer of 1945. The interviews 
covered attitudes with respect lo various major economic and social 
issues, as well as with respect to class identification. Included was a 
battery of six questions designed to test the respondent’s conservative- 
radical orientation on socio-economic issues. On the basis of the replies 
to these six questions, individuals were classified into five categories 
in reference to expressed conservatism-radicalism. In Figure 98 will 
be found the relative frequency of these five response categories among 
individuals of different occupational levels. Separate results are given 
for urban and rural samplings. The occupational differences are large 
and clear-cut, the author concluding that such differences leave little 
doubt that people’s politico-economic orientations an closely asso- 
ciated with their socio-economic statuses. Another interesting observa- 
tion was that, within any single occupational category, those persons 



Socio-Economic Di fferences 


797 


who subjectively identified themselves with the ‘‘working class” ex- 
pressed more radical attitudes than those who classified themselves in 
the “middle class.” 

In view ol the large diflcrences in the traditional activities, motiva- 
tions, and attitudes of the various socio-economic classes, we would 
expect certain concomitant difi'ercnces in intellectual development. In 
actual fact, nearly every investigation in which intelligence tests have 
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Fig. 98. Attitude Differences of Occupational Strata with Reference to 
Politico-Economic Conservatism-Radicalism. (From Centers, 11, p. 58.) 


been administered to persons in different socio-economic levels has 
shown differences in the same direction. The trends are exceptionally 
consistent. In the sections \^hich lollow, we shall examine typical 
results obtained w^hen intelligence test scores have been analyzed with 
reference to occupational categories, as well as with reference to other 
characteristics of social class. We shall also consider specifically some 
of the findings on relatively isolated groups, such as mountain dwell- 
ers, and on urban and rural populations as a wnole. 


OCCUPATIONAL IFV LL AND IN TELLlCiENCL 

The first large-scale survey of the intelligence test performance of men 
engaged in different occupations was provided by an analysis of the 
Army Alpha scores obtained in World War I. On the basis of the 
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scores of about 18,000 men, the mean and range of Alpha scores 
were computed for 96 major occupations (24). The results fell into 
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Fig. 99. ACjCT Scoie in Relation to Civilian Occup i+ion (Data from 
Stew lit, 8S, pp 5 I'i ) 


a distinct "'ocLiipational hierarchy/^ with the highest scores in the 
professional gioups and the lowc>t among unski led laborers Although 
overlapping was large and mean differences between adjacent groups 
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were slight, the differences between occupational groups in different 
portions of the hierarchy weie large and statistically significant. 

A corresponding analysis has been earned out with the AGCT 
scores obtained in World War II (88). Data on fifteen common occu- 
pations, selected from difieicnt parts of the liierarchy, arc portrayed 
in Figure 9Q. In the complete analysis, similar data are provided for 
211 occupations which were represented in sufficiently large numbers 
to yield reliable information. 1 he results were based on the scores of 
81,553 white enlisted men. derived Irom a random 2% sample of 
army personnel.'' Because of selective factors operating in deferments, 
rejections, and discharges, such army samplings cannot be regarded 
as rcpiesentative of adult cKihan pc/pulations. Moreover, no olficcrs 
were included in the occupational survey, llius further restricting the 
distribution at the upper levels. The icprcst^ntation of professional 
groups is especiallv limited by these factors. Dalti on doctois and engi- 
neers, for example, are virtually non-existent in such tabulations. 
Despite these limitations, it is clear that the intelligeikc lest scores fol- 
lowed the general occup<ilional hierarchy, paralleling the socio- 
economic Status of the van nis groups.’^ 

It may be of interest to note parenthetically that the intelligence test 
rank of those occupjUons which could be directly compared in the 
World War I and VVorld War II figures correlated .84 (88). On the 
whole, the hiemrehy icmained substantially the same over the twenly- 
fiv^e-ycar period. None of the major occupational dillcrenccs was 
reversed. Miny of the small shiffs which did occur, however, could 
not be attributed to chjn<.^ fiuclualicms of sampling, but represented 
real differences in the populations tested during the tw'o wars. A multi- 
plicity of conditions, both in the ar; y situation and in society at large, 
could account for such >hilts in rchuive occupational inFdligence. One 
possible factor may be found in shifts in the prestige value and in the 
skill requirements of certain jobs, as a result of technological ad- 
vances. For our present purpose, however, it is the consi^tcncy of the 
major occupational dificrenccs, rather than the minor shifts, which is 
of primary concern. 

^ The sample was a completely random sample of the entiic army personnel 
In the ocL'paional analysis, ofTitcrs, enlisted women, and non-white enlisted men 
were not included, because * f small number of cases or lack ot ACjCT scores As a 
result, the cases actually used represented appro\imalely 1.6^ of the told army 
population. 

A similar analysis conuucted by Harrell and Harrell ('’V) on Army Air Forces 
personnel yielded results vhich are in siibstmlial agreement wuth those obtained in 
the larger and more representative sample discussed above. 
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SOCIAL STATUS AND THE INTELLIGENCE OF CHILDREN 

The correspondence between intelligence and occupational status is 
by no means limited to adults, but is also found among the children 
of men engaged in different types of work. Thus the relationship can- 
not be attributed primarily to differences in vocational experiences 
and in amount of formal education. More general conditions must be 
involved, which characterize not only the men, but also their families. 
The differences persist even when children in the same school grade 
are classified according to the occupations of their fathers. Such find- 
ings have been obtained in a large number of investigations, from the 
preschool to the college level.'^ In Table 59 will be found an occupa- 
tional analysis of data collected during the standardization o** the 1937 

TABLE 59 Mean Stanjord-Binet IQ's of 2757 Children Classified 
Acvoiding to Paternal Occupation 
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revision of the Stanford-Binet (66). These results, based on one of 
the largest and most representative samplings of American children 
ever tested, are typical of those found by other investigators with a 
variety of intelligence tests. In general, there seems to be a difference 

® Many of these studies have been summarized by Neff (68) and by Loevinger 
(^ 7 ). 





Socio-Economic Differences 801 


of about 20 points between the mean IQ’s of the children of profes- 
sional people and those of the children of day laborers. 

It should also be noted that these differences in IQ are just as con- 
spicuous at the youngest age level (l-SVz) as they are at the oldest 
(15-18). This finding is supported by other investigations within 
these age ranges (34, 71, 72). Moreover, studies on the intelligence 
of preschool children have revealed a similar relationship with paier- 
nal occupation. This is illustrated in Table 60, which gives the mean 
Kuhlmann-Binet IQ’s of 380 children between the ages of 18 and 54 
months, classified according to father’s occupation (28). This group 
was retested within about six weeks, the results of both tests being 
shown in Table 60. 1 he group differences were even larger on the 
retest than on the initial test. 


TAMvE 60 Mean Kulilnuinn-Binct }Q's of 3S0 Preschool ChUJren 
Classified According to Paternal Occupation 

O'lnni ( In(»<U'iioir‘b, |) 2.S7) 
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A number of inves'igalors hav, employed scales for rating the 
socio-economic level ol the home, thus permitting the computation of 
the correlation between each individual's intelligence test score and 
socio-ecomnnic level. Most of these scales involve visits to the home 
and interviews with parents, in order to obt.-.in information for rating 
a number of home conditions.'' On • of the earliest of such scales was 
the Whittier Scale for Grading Home Conditions (101). Another, 
more restricted in scope but correlating highly with other scales, is the 
Chapin Living-Room Equipment Scale (12). The Sims Score Card 
for Socio-Economic Status (84) is a questionnaire covering cultural, 

'■'For a survey of such sc.Ucs, cf. Leahy (SI) and Loevingcr (57). 
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economic, educational, and occupational status of the family, to be 
tilled out by the child himself. One of the most comprehensive and 
carefully standardized scales is the Minnesota Home Status Index, 
devised by Leahy (51). This scale yields a “home status profile” in 
terms of sigma-scores on each of six measures: children’s facilities, 
economic status, cultural status, sociality, occupational status, and 
parental education. Each of the first lour measures is based on from 
eleven to thirteen questions asked of one of the parents. 

It should be noted that Ihe correlation between socio-economic 
indices and intelligence is likely to be curvilinear, since the distribu- 
tion of intelligence test scores is approximately normal, while the dis- 
tribution of socio-economic level is quite skewed, with a piling of 
cases in the lower portion. Consequently, computation of the usual 
Pearson coefficient ol correlation between these two variables will 
underestimate the relationship between them. A few investigators 
have computed eta, or the correlation ratio, for this reason. Another 
solution is illustrated in the Lpi\cisiiy of Caliiornia Socio-Economic 
Index (4), in which family income is transmuted into a logarithmic 
scale, yielding a more ncarl) normal distiibution; the transmuted in- 
come measure is then combined wuth amount of paiental education, 
occupational level, and a composite rating of home, living room, and 
neighborhood. 

Rather than treating socio-economic level as a unhary variable, 
some investigators have stressed the importance of determining whut 
.specific features of the ciniromnent are associated with intclfigcnce 
test performance. In this connection, attempts liavc been made by 
Van Alstyne (93), Skodak and othcis to design scales which 

cover the psychologically moie significant aspects of the child’s environ- 
ment, such as parent-cliild contacts and the opportunities for various 
types of activitv. A weakness of the^e scales is the role of subjective 
factors in the original choice of variables, as well as in the ratings 
themselves. However, they suggest interesting possibilities for further 
research. 

It is apparent that the available indices of socio-economic level vary 
considerably in the aspects of environment which they measure. In 
view of the fact that the tests used to measure intelligence have also 
varied from one investigation to another, as has the choice of sub- 

cp, the perceiit.ipc of people in each of the social classes shown in 

Figure 96. 
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jccts, it is not surprising to find a wide range of values given for the 
correlation between ‘'socio-economic level” and “intelligence.” Be- 
tween the ages of 3 and 1 8 years, most of these correlations are in the 
vicinity of .40, although some arc as low as .20 and some slightly 
over .50 (cf. 57, 68). The correlations show no consistent age trend 
within these age limits. Below age 3, the coi relations drop (4, 41), 
and between birth and 18 months they arc generally zero or slightly 
negative (4, 25). Ft should be recalled that psychological tests for 
infants are largely measures of simple sensori-motor development. 
As the child glows older, the tests become increasingly verbal and 
abstract in content. Since dillcreiit functions arc tested, there is thus 
no real inconsistency between the correlations obtained on infants and 
those on older children. On the whole, studies employing the correla- 
tion technique have corroborated the cruder results obtained by the 
comparison of occupational groups. 

Approaching the same problem from a slightly dilTerent angle, 
Havighurst and his co- workers (36, 37, 42) made use of the ''soiial 
.stcirus metinnr discussed in the first section of the present chapter. A 
midwestern community with a population of about 10,000 was 
chosen, and the families placed on a scale of live social classes by the 
prexiously described methods. Psychological tests were admiuKtered 
to nearly complete samplings of children at three age levels, 10, 13, 
and 16. The 10- and 16-year-olds were given well-known intelligence 
tests, including verbal and pertornuince bpes, as w^ell as specialized 
tcTs of reading and of sp.dial and mechanical aptitudes. The 13-year- 
old group was tested wiiu the Ihuistone 1 ests ol Primary Mental 
Abilities. The mean scores of the diderent status groups on each of 
these tests aie si.mmari/ed in lai s 61, 62, and 63, shown on the 
two following page ,. 

These three surveys are somewhat limited by the small number of 
cases available at each status level. The representation of the two 
highest social classes (A and B) was especially inadequate: in the 
10- and l3-ycar-old samples there were to./ few A or B children to 
warrant the inclusion of these categv . in the statistical analyses; and 
in t!ie 16-ycai-old sample there were only 9 children in both classes 
combined. Despite the small number of cases, nearly all tests in all 
three samples show a Icndencv for .scores to rise with social status. 

^K'alled Micf\\('\f in thvst aiiiclcs but elsewhere designated by llie pseudonyms 
of Prauic City ( ^S) anil / (40^ 



TABLE 61 Mean Test Scores Obtained by lO-Yeur-Old Children in Different Social Status Groups 
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’*-This group was siuciied later than the i0'\eai-olds and l6-> ear-olds. 



TABLE 63 Mean Test Scores Obtooied by Ib-Yiar-Old Children in Difjercnt Social Status Groups 
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To be sure, some of the differences between adjacent status groups are 
insignificant and a few arc even reversed; but when extreme groups 
are considered, most of the differences are statistically significant. 

The most clear-cut exception to the above trend is found in the 
Minnesota Mechanical Assembly Test given to the 16-ycar-old boys 
(Table 63). The mean scores on this test showed no consistent trend 
in relation to social status, and the highest mean was obtained by the 
lowest social group. A possible reason lor these discrepant results is 
that the lower-status boys may have had moie experience in dealing 
with mechanical objects and may thus have been more familiar with 
the tasks involved in the test. Direct comparisons among different tests 
arc complicated by the fact that their units m.iy not represent com- 
parable steps, even when expressed as lO's. It is nevertheless possible 
to detect a tendency for stains differences to be more conspicuous in 
the more higlilv \ethul txpe of tests. For example, among the I6-ycar- 
olds, the mean difference between extreme status groups in Wechslcr- 
Bcllcvue Perform jnee 10 vas only 15 points, as compared to 30 
points in the wStanloid-ninet. The critic il mtios were 2.4 and 4.1 for 
the former and latter differences, respective’y. Moreover, the position 
of groups D and L is re\ctse(l in the Performance IQ. wh^le all groups 
vary in a conastent direction on the Slanfoid-Binct. Siini! irly, among 
the lO-ycar-olds, the me<in 10 diif.renccs betv\ecn extreme status 
gioLips are 23, 20, and 1(> fQ poi-ils on the wSlanford-Binet, Conicll- 
Coxc, and Goodenough tests, respecti\e!y. Altliough the status diflcr- 
cnces on the dillerent teds arc more nearly unifoim among the 
10-year-olds than among the l(vyLar-c'! N. c\en in tec }oon ^er g-onp 
the Stanfoid-Binct lends to show larger critical ratios than do the 
peiformance scales. 

It muhl be noted parenthetically that both age and sex compa^'is^ns 
in this investigation piovide interestinir coiroboration of some of the 
points discussed in the chapter on trait orgaii'/ation (Gh. I ' Tlmu 
among lO-vcar-olds, all tests corre lut».,d faiily Ir’ghly and uniforinly 
with each olhir. Among the l6-year-oH bo\s, mechanical apMtudc 
appears to have become diffeicntiatcd as a special aptitude, but this 
is not true for the 16-\ ear-old (tirls. The status differences al^'O fit in 
with these developmental chang^cs. Tlius among the J 6-) car-old boys, 
mechanical aptitude did not show the same status difT-'rences as did 
the verbal tests. Among the girls and among the younger children of 
both sexes, the status differences in mechanical aptitude w'cre more 
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nearly similar to those in verbal tasks. This would be expected, since 
mechanical aptitude was not so clearly differentiated from verbal apti- 
tude within these two groups. 

In the fhurstonc tests administered to the 13-year-olds, it is again 
apparent that status dillcrenccs vary with the function tested. The 
mean scores of the status groups are shown in Table 62. None of the 
differences between groups C and D was significant at the .01 livel 
of confidence, although the largest critical jatio (2.0) was found on 
the Verbal Comprehension test. When level L is compared with either 
C or D, the critical ratios for the Number, Verbal Comprehension, 
and Word Fluency tests range from 2.8 to 4.9, while those for the 
Spatial, Reasoning, and Memory tests range from 1.5 to 2.5. The 
same discrepancy is indicated by the correlations between score on 
each of the six tests and ‘‘index of status characteristics.” The latter 
is a composite index Dased upon occupatii)n, source of income, house- 
t\^pe, and community-area in which home is located. The investigators 
report that in Midwest this index agreed closely with the social posi- 
tion of individuals as found by the more elaborate social status 
method. The correlations of the status index with scores on each of 
the Thurstonc tests weic as follows: 


Number (V) 

'^2 -t .10 

Verbal Comprehension (F) 

.4? iL .09 

Spate ( S ; 

25 -H 10 

Woid } liicncv (H ) 

.^0 ±: .10 

Rcasomag (/?) 

.2^ .10 

Mcmoiy 

21 ± .10 


It will be seen that the correlations with the Number, Verbal Com- 
prehension, and Word Fluency t- >ts are higher and more nearly 
significant than those with the rema ning three tests. The in\cstigators 
suggest that the correlations with social status arc higher in those 
abilities which might be favored by a superior social environment. 

Some investigators have demonstrated a correspondence between 
the socio-economic ratings of a whole comnnmity and the mean IQ of 
its children. In a study of over 300 icighborhoods in New York City, 
each with a population of about 23,000, Mailer (61 ) found a correla- 
tion of .50 between economic status of the neighborhood and mean 
IQ of its school children. Economic status was determined from fed- 
eral census data regarding value of home rentals in the neighborhood; 
children’s IQ’s were based upon a battery of group tests, including 
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the National Intelligence Test and the Pintner Survey Test, given to 
over 100,000 fifth grade public school children. Working with even 
broader units, Thorndike and Woodyard (92) report very high corre- 
lations between the mean National [ntelligence lest scores of sixth 
grade pupils and various social indices obtained for 30 cities. For 
example, the intelligence test scores correlated .78 with the index of 
per capita income for each city, and they correlated .86 with a com- 
posite index of the general "‘goodness” of community life, based on a 
variety of criteria. 

Mention may also be made of studies conducted in oilier countries, 
all of which demonstrate the same correi pondence between intellec- 
tual and socio-economic variables. Whether children are classified into 
a few categories on the basis of paternal occupation, or whether more 
precise socio-economic indices arc correlated with intelligence test 
scores, the results closely corroborate those obtained on American 
children. Comparable data have been obtained on large groups of 
subjects from early infancy to high school age in such countries as 
England (10, 21, 65), Scotland (23), Poland (69), Rumania (70), 
the Soviet Llnion (20, 85), and Hawaii (56). 

In all these comf)arisons of intellectual and socio-economic varia- 
bles, we must not lose sight of the wide range of individual diTerenccs 
within each level, nor of the related fact of i)verlr,p;>ing between levels. 
'Fhe fact that correlations between individual test scores and individual 
ratings for socio-economic factors fall far sliort of 1.00 is just another 
indication of this overlap. It is also w'cll to remember in this connec- 
tion that the total number of persons in the lower socio-economic or 
social-status classes is larger than that in the upper levels. The result 
is that if we begin with intellectual rather than with socio-economic 
categories, we may lind that a larger percentage oi intellectually supe- 
rior persons come from the lower than from the upper social classes. 
For example, in a survey of more than 1()(),()()0 high school seniors in 
Wisconsin, the investigators report the occupations of the fathers of 
those students who fell above the group median in intelligence test 
score (8). Within this sub-group, only 7.9% had fathers in the profes- 
sions, while 17.4% had fathers in skilled labor. This was true despite 
the fact that the median percentile score of all subjects with profes- 
sional fathers was 68.5 and that of all subjects with fathers in skilled 
labor was 51.1 . These findings arc by no means peculiar to this study. 
Similar results would be obtained in nearly every study, if the data 
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were expressed in the same form. 1’his simply means that the lower 
socio-economic classes may contribute more individuals of fairly high 
intelligence than the upper classes, although relative to the total 
number of persons in each socio-economic class, the contribution of 
the higher socio-economic classes is greater.^- 

A word may also be added regarding possible interpretations of 
the relationship between socio-economic factors and intelligence. The 
association per se does not, of course, provide any clue regarding 
causation. On the one hand, it can be argued that the intellectual 
diiTercnces found today among social groups testify to a gradual 
hereditary dilfcrentialion which has been going on through selection. 
1 hus the more intelligent individuals would gradually work their way 
up to the more demanding hut more highly coveted positions, each 
person tending eventually to ‘'find his level.” Since intellectually 
superior parents tend io have intellectually superior oflspring, the 
children in the higher social strata would be more intelligent, on the 
whole, than those from the lower social levels. A second hypothesis 
would explain the intellectual development of the child in terms of 
the cultural lc\'el in which he is reared. Thus the child who grows up 
in the home of a construction laborer docs not have the same oppor- 
tunities for intellectual development — and consequently will not reach 
the same ability level — as a child of equal initial capacity brought up 
in the home of an eminent scientist or author. A third possible 
hypothesis is that the relationship between socio-economic and intel- 
lectual variables is indirect rather than direct. Thus both sets of 
variables may be related Uirough some other factor, such as person- 
ality characteristics, national origin, or family size (57). 

We cannot choose among these 1 N’potheses without probing further 
into the particular circumstances iU each case. Some investigators 
have been impressed with the finding that class ditfercnces in intelli- 
gence test performance appear so early in life and are practically as 
large among 3-ycar-olds a.s among 1 8-ycar-olds. This has frequently 
been regarded as evidence for a hereditary iriterpretation of class dif- 
ferences, on the grounds that cnv.^» nmcntally produced differences 
should increase with age, as environmental factors have more time 
to operate. It is impossible, however, to make a universal generaliza- 

Beyond a certain point, the laigcr N in the lower classes would no longer be 
suflicient to counteract the shrinking propoiiional contribution. This was true, lor 
example, in Terman’s group oT gilled children, in which had fathers in the 

professional class, as compared to 11.8". in the skilled lahoi class (cf. Ch. 17). 
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tion regarding the relation of age to environmental influences. The 
trend may well be opposite in different situations. If the environ- 
mental differences continue to an equal degree or increase with age, 
then we should expect their differentiating effects on behavior also 
to increase. But if any equalizing influences are introduced into the 
environment at certain ages, as at the time of school entrance, these 
might counteract the divergence of behavior development otherwise 
expected. 

A comparison of class differences in various types of ability may 
throw some light upon the origin of such differences, especially when 
the results are examined m the light of cultural dissimilarities among 
the groups. In the sections which follow, additional data hearing upon 
these questions will be presented Irom investigations conducted on 
relatively isolated groups, as w^ell as on in ban and lural populations. 

Tin: INTI 1 1 v( TLxi nrvi i opmi nt or isolatfd groups 

Certain groups have been of special intercNt to psychologists because 
of their relative isolation fiom outside social contacts. One of the 
most widely quoted <‘liiclies on such isolated group, is that conducted 
by Gordon (29) on canal-boat and gypsy children in hngland. Gor- 
don’s report, made in the course of his (ffhcial duties as Inspector of 
Schools, was based on the vSlanforci-Binct IQ’s and educational test 
scores of various groups of children whose schc'>oling was deficient. 
The iiuicil-hocit cliihlicn were enrolled in special schools maintained 
for them, which they were able to attend only while the canal boats 
were tied up for loading or discharging. It was estimated that the 
average school aUcnclancc of these canal-boat children was only 5% 
of that in ordinary elementary schools. The majoiity were able to at- 
tend school only about once a month for one or two consecutive half- 
days. Their home surroundings, although satistactory in respect to 
conditions of health and cleanliness, w^ere intellectually of a very low 
order. Many of the adults were themselves illiterate, and each family 
Jed a relatively isolated existence, with a minimum of social inter- 
course. 

The average 10 of the entire group of 76 canal-boat children was 
69.6. Taken at face value, this would suggest at bc^t a borderline 
group, with a few distinctly feebleminded individuals. Further analysis 
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of the data, however, brought out the fact that IQ declined sharply 
with a^e within the group, the 4- to 6-year-olds obtaining an average 
IQ of 90, while the oldest group (12 to 22 years) averaged only 60. 
The correlation between IQ and age was —.755. Even when children 
in the same family were compared, a consistent drop in IQ from the 
youngest to the eldest sibling was noted. Moreover, the mental ages 
of children within a single family tended to be very similar, even 
though their chronological ages differed. Such a mental age might 
well represent the limit of intellectual development which was made 
possible by the available educational opportunities and the type of 
home environment furnished within the given family. 

The results on the i^ypsy children were similar but less extreme 
than those on the canal-boat group. The mean IQ of the 82 gypsy 
children was 74.5, and the correlation between age and IQ was 
— .430. Thus both tiu total inferiority and the age decrement in intel- 
ligence were less pronounced in this group than in the canal-boat 
group. Corresponding to these findings is the fact that the school 
attendance of the gypsy children averaged considerably higher than 
that of the canal-boat children, being 34.9% of the total number of 
possible school days. The g 3 n:^sy families led a nomadic existence, the 
children attending school only during the fcvV winter months when 
they had a fixed abode. Although their living conditions were crude 
and primitive, these gypsy cliildrcn had more social contacts outside 
of their immediate family, and were thus less isolated than the canal- 
boat children. It is also notcwortliy that within the gypsy group, IQ 
showed a significant posii 'c correlation of .36S with amount of 
school attendance for each child. It is possible, of course, that the 
relationship between amount of scho» ling and intelligence, both within 
and between these two groups, ma'^ have resulted in part from the 
greater willingness of the brighter children to attend school regularly. 
At least two different lac tors, however, seriously w eaken this hypoth- 
esis. In the first place, it was physically impossible for the children 
to attend school while the eanal boats or gypsy caravans were in 
motion. Secondly, the gypsy children frequently had to be forced by 
local authorities to attend school durmg their brief winter periods of 
stable residence. 

Of special interest is the age decrement reported by Gordon for 
both canal-boat and gypsy children, but not found in surveys of more 
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privileged groups. One possible explanation for such a decrement is 
that the intellectual needs of the younger child can be satisfied almost 
as well in the restricted environment of the canal boat or gypsy camp 
as in a prosperous urban home. As the child grows older, however, 
the differential effects of poorer home environment and of deficient 
schooling become increasingly apparent. Another factor which un- 
doubtedly enters into the obtained results is the well-known difference 
in the functions measured by intelligence tests at the lower and upper 
age levels. The increasing emphasis upon verbal and other abstract 
functions at the older ages might well present a progressively greater 
handicap to children whose environments do not encourage the de- 
velopment of these abilities. To be sure, the data do not in them- 
selves preclude an interpretation in terms of some hereditary struc- 
tural deficiency which might make these particular groups inferior in 
verbal and abstract functions. It might be argued that such a defi- 
ciency would not be apparent at the younger age levels, since these 
functions cannot be adequately tested among young children. 

On the basis of the data presented by Gordon, it is not possible to 
choose conclusively between these two hypotheses. It sliould be noted, 
however, that no hereditary structural basis for verbal aptitude or 
other functions measured by intelligence tests has yet been discov- 
ered, nor docs its discovery appear likely or plausible in the light 
of what wc do know^ reg'^rding the mechanism of heredity. On the 
other hand, there is a multitude of known factors in the environ- 
ments of these children to account for their deficiencies. In fact, it is 
dilficult to sec how any child, whatever his heredity, could obtain a 
normal or superior Stanford-Binct 10 if reared in the environments 
of these canal-boat or gypsy children. 

Studies on mountain children have closely corroborated Gordon’s 
findings. An unusually good opportunity for the study of isolated 
commimilies is offered by the highlanders of our southern moun- 
tains. Owing to poor roads and general inaccessibility, many of these 
groups live in complete isolation during the larger part of the year. 
In certain districts, the cultural level is extremely low, little more 
than the bare necessities of life being available. Families are fre- 
quently found living in the original crude huts built by their ancestors 
several generations ago. Racially these groups are rffatively homo- 
geneous, being predominantly of British descent. They are highly in- 
bred, and in certain communities only two or three different surnames 
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are to be found. The peculiar customs and manners of the southern 
mountaineer have long stirred the imagination of author and play- 
wright. As a result these highland people have achieved a certain 
amount of glamour in the mind of the public, a kind of notoriety which 
overshadows the squalor of their lives. To the psychologist, these 
groups oiler a challenging opportunity to unravel the forces of hered- 
ity and environment. 

Intelligence test surveys of children living m such isolated moun- 
tain communities have been conducted by Hirsch (39) and Asher 
(2) in Kentucky, Sherman and Key (82) in the Blue Ridge Moun- 
tains, Edwards and Jones (22) in Georgia, and Wheeler (99, 100) 
in Tennessee. Tlic results of all these studies are quite uniform. Aver- 
age JQ is clearly bGow the national norms; the inferiority is more 
marked on verbal tests, such as the Slanford-Binct and National In- 
telligence Test, and less marked on non-language and performance 
scales; and the same type of age decrement reported by Gordon is 
found among these mountain children. 

Typical results from the study cimdiicted by Sherman and Key 
(82) are shown in Table 64. The subjects included 102 mountain 
children living in four ‘iic^lows” in the Blue Ridge Mountains, ap- 
proximately 100 miles from Washington, D. C., as well as 81 chil- 
dren living at Briarsville, a small village situated at the base of the 
Blue Ridge. These subjects represented over one-half of all children 
living in the five centers. Each of the five communities dilTcred in 
length of school term, quaihv of schooling, and general level of ma- 
terial culture. Racially, however, the subjects were quite homoge- 
neous, all being descended from a common ancestral stock. It was thus 
possible to make intercompari'^ons a nong the groups, in addition to 
an evaluation of scores in terms of urban norms. 

It will be noted from Table 64 that both village and mountain 
children fall below the ‘htormak’ IQ of 100 on nearly all tests. The 
inferiority is, however, less pronounced among the village children, 
who had better schooling facilities. Both groups show a fairly con- 
sistent age decrement, which is also Ics.. marked in the village group. In 
the case of the mountain children, the mean IQ’s tended to be lower 
on the verbal than on the non-verbal and performance tests. 

For dcsciiplive material regarding these people and their sin roundings, the 
reader may examine the accounts by Campbell (9), Kephart t46), and Raine (74), 
as well as the more recent icport by Lewis (^3). 
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TABLE 64 

to Age 


Mean IQ of Mountain and Village Children in Relation 


(Frtmi Slit'ninn and Key, 82, p. 287) 


Pintncr- 

Pintncr- National Goodcnniigh Paterson 

CimnittfiJiam IntcUii’ence Draw-a- Performance 


Age 

Test 

Test 

Man Test 

Tests 

Ml. 

Vill. 

Ml 

VJll. 

Mt. 

Vill. 

Mt. 

Vill. 

6- 8 

84 

94 




93 



8-10 

70 

91 


117 

66 

82 

76 

93 

10-12 

53 

76 

66 

101 

71 

69 

70 

87 

12-14 



67 

91 

69 

73 

83 


14-16 



52 

87 

49 

70 

73 



Of special interest is Ihc study conducted by Wheeler (99, 100) 
on East Tennessee mountain children. Group intelligence tests were 
administered in 1940 to over 3000 children in forty mountain schotds. 
The results were compared with those obtained on children in the 
same areas and largely from the same families, who had been simi- 
larly tested in 1930. During the intervening ten-year pffriod, the eco- 
nomic, social, and educational status of these sections is reported to 
have improved considerably. Paralleling such environmental improve- 
ments, a rise in IQ from the first to the second sampling was noted 
at all ages and all grades. The median IQ’s were 82 and 93 in the 
1930 and 1940 samplings, respectively. The usual age decrement was 
found m both samplings. In the 1930 group, the median IQ dropped 
from 94.7 at age 6 to 73.5 at age 16; in the 1940 sample, it dropped 
from 102.6 at age 6 to 81.3 at age 15. 

It might be added that such an age decrement is not limited to the 
relatively unusual groups of children discussed in the present section, 
but has also been reported for other underprivileged groups. This 
decrement is especially apparent where educational facilities arc de- 
ficient. Among the groups for which such a drop in IQ with age has 
been found may be mentioned: southern mill-town children of low 
socio-economic level (45); children reared in a “high delinquency 
area” in a large city (54); and children admitted to an orphanage 
after varying periods of residence in their own, ver> inferior homes 
(86). Such age decrements have also been noted in many investiga- 
tions on rural children, to be discussed in the following section. 
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INTELI IGENCE TEST SURVEYS OF RURAL CHILDREN 

The distinction between urban and rural populations is partly one 
of occupation, but it also involves other important aspects of the 
physical and social environment. Most of the dilTcrcnccs are such 
as to handicap the rural child in academic progress and in the type 
of abilities sampled by most intelligence tests. 'Thus educational op- 
portunities are notoriously poor in many rural districts of oui coun- 
try, in sharp contrast to the excellent facilities a\ai!ablc in most towns 
and cities. The length of the school term is often shortened in rural 
communities because of the impa.ssable condition of the roads at 
certain times of the >ear, or because the children are needed to help 
with farm duties in busy seasons, or for other reasons of a local 
nature. In some cases tlie school term lasts only six months. Similarly, 
the difference in t\pc and amount of instTuction leceived in the '‘con- 
solidated” and the “one-room" school is a very real one. In the latter 
type of school, in which pupils of all ages and grades are taught 
by a single teacher and in a single classroom, progress must neces- 
sarily be very hailing. Differences in the provision of books and other 
supplies, as well as in teacher training, arc too obvious to mention. 

The general cultural miheu of dillerent localities likewise presents 
striking contr<ists. Libraries, museums, and other facilities for the 
intellectual or artistic stimulation of the community arc far more 
accessible and better developed in urban than in rural districts. The 
recreational activities of rural children are quite different from those 
of urban children, as show , for example, in the extensive survey of 
play activities conducted by Lehman and Witty (32). These investi- 
gators concluded that the difference^ are “directly traceable to envi- 
ronmental opportunities,” and that ^aich differences mav in turn in- 
fluence the direction of the child’s intcIJcctual development. The 
extent and variety of social contacts also differentiate city and country 
groups. Between the cosmopolitan associations of the large metrop- 
olis, with its kaleidoscopic array of diverse customs, manners, and 
peoples, and the relatively homogcrcous and sparse contacts of the 
rural village or open country, there exist tremendous differences in 
social stimulation. 

The fact that country children as a group score distinctly below the 
city norms on current intelligence tests has been repeatedly demon- 
strated. Numerous investigations, some employing several thousand 
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chiWren and covering practically complete school populations, have 
consistently revealed the inferior performance of rural children in all 
parts of the United Slates. Typical results are given in Table 65, 
based upon McNemar’s analyses of the standardization sample of the 
1937 Stanford-Binet (66). The investigators made a very serious 
effort to obtain urban and rural samplings which were representative 
of the country at large, but they express a certain amount of skep- 
ticism regarding the adequacy of the rural sampling. They point out, 
however, that the selective factors wer*^ such as to reduce the urban- 
rural differences in their data. Thus the differences between city and 
country children would probably have been still larger if a more 
representative group of rural children had been surveyed. 


TABLE 65 Mean Slanjorci-Hinet IQ's of Urban, Suburban, and 
Rural Children 

(I rom Me \cniar, O), jt .’7) 


Locality 



yiyr Ranye in Years 

h 14 

15-J8 


N 

Mean 

N 

Mean 

N 

♦ 

Mean 

Urban 

354 

100.3 

864 

105.8 

204 

107.9 

Suburban 

158 

105.0 

531 

104.5 

112 

106.9 

Rural 

144 

100.6 

422 

95.4 

103 

-95.7 


Separate means for urban and suburban groups arc included in 
d'able 65, but as would be expected, these groups did not differ 
appreciably. Suburban communities are within commuting distance 
of large cities, and they share most of the benefits of urban centers. 
The rural children, on the other hand, average about 10 IQ points 
lower than the urban during school age (6-18), and about 5 points 
lower during the preschool period (2-514). It is noteworthy, too, 
that the investigators report a slight tendency for rural IQ’s to drop 
at the beginning of the school period, no such tendency having been 
found among urban children (91 ). 

That age is an important factor in the amount of urban-rural dif- 
ference was also demonstrated in the thorough and comprehensive 
investigation conducted by Baldwin, Fillmore, and Hadley (3) on 
Iowa farm children. Children in four rural communities were com- 


For a good summary of studies conducted prior to 1930, cf. Shimberg (83). 




817 


Socio-Economic Differences 

pared with Iowa City children, as well as with the test norms. The 
number of rural subjects tested at each age level, together with the 
tests employed, arc shown below: 

123 infijils (4 to 40 iponths) Iowa Baby Tests 

163 pjeschool chilJicn (3 (o 6 years).. . Ociioit Kindergaften Test 

871 school children Stanfoid Binel 

Oils Inlelligence l est 
Pinliier-I'citerson (.S selected tests) 

The results showed that, among the rural infanls, there was no 
noticeable inlcrionty on the baby tests. Nor can this lack of dilTeren- 
tiation be attributed a deticicncy in the discriminative power of 
the tests, since wide individual dhleiences were obtained. In the 
preschool group, a rural inferiority appears in the 5- and 6-year-oid 
groups, no signiheaiit dilTerenccs having been found at tb.e voungcr 
ages. I'he rural Sih .ol chiidicn. hov\cvcr, showed a dcliii’/e 'mlellec- 
tual retardation whiih became increasingly large as they progressed 
through school. 

4 his delicieney, futhennorc, was more pronounced in the one-ioom 
than in the consolidated schools. In the latter type ol school, the intel- 
lectually retarded children were found chiefly at the upper ages; 
whereas in the one-room schools, the median mental age was alw'ays 
lower than the median chrunological age. The niediam mental age 
deficit in the onc-room schools ranged from 1 to 6 months up to the 
age of between the ages of 9 and 12 it increased ironi 7 to 14 
months; and at ages 13 and 14 it amounted to 16 and 39 months, 
respectively. In the coii- didated schools, on the other hand, the 
median mental age exceeded the median chronological ago up to the 
age of 13, the excess hrr each age anging from 1 to 8 months. From 
13 to 18 years, llie mediian mei'‘.t: I age was lower than the median 
chronological age, the deficit in successive years being 2, 6, 19, 10, 
11, and 10 months, respectively. The drop in amount of retardation 
beyond age 14 may be due to the more select nature of the older 
groups in rural schools. 

An analysis of the rural ehildrcn'^^ performance on different tests 
or parts of tests revealed their handicap on verbal materials. In regard 
to the performance tests, it is interesting to note that the farm chil- 
dren excelled on the Marc-and-Foal Test, a picture completion test 
portraying a farm scene. On all other performance tests which in- 
volved speed, the rural subjects were deficient, their movements tend- 
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ing to be slow and deliberate. The usual instructions to work rapidly 
did not in themselves seem to provide sufficient incentive for these 
children. The rate of movement could, however, be increased if other 
appeals were added. The investigators suggested that “the children’s 
apparent lack of comprehension of the meaning of hurry is to be 
expected as a consequence of some of the inOucnces that surround 
them” (3, p. 254). 

In summary, these studies, together with many other similar sur- 
veys, show rural children to be consistently inferior on both intelli- 
gence tests and educational achievement tests. This inferiority tends 
to be greater on verbal than on non-verbal tests, and greater on tests 
which emphasize speed. With increasing age, rural scores tend to 
decline in relation to urban norms. Rural inferiority is also more 
marked on group than on indi\idual tests. It has been suggested that 
the country child's perlormancc on a group scale may be handi- 
capped by his greater shyness with strangers (71). This difficulty 
would be partly overcome by the examiner’s efforts to establish rap- 
port in the administration ol an individual scale. 

Average scores also tend to be lower in those districts with poorer 
schooling facilities. Ihis is most clearly apparent in tlTc comparison 
of one-room with consolidated scheuds. A somewhat diflcrent analysis 
of rural areas was employed by Pressey and Thomas, in their 
study of Indiana farm c’aldrcn (73). When school children were 
classified into those living in “poor" farming districts, where the land 
was hilly and the soil inferior, and those living on ‘"good” farm land, 
the two groups were found to differ significantly in intelligence test 
performance. Among the children in the good farming districts, 36% 
reached or exceeded the median of city children, as contrasted to 
only 20% in the poor farming districts. When a younger group (ages 
6 to 8 years) was tested in the poorer farming area, 22% of the 
children reached or exceeded the urban median (71 ). In explanation 
of these findings, the authors suggest that a constant selective process 
goes on in farming areas, the inferior, less intelligent families being 
pushed back into the hill country where the soil is poorest. It should 
also be noted, however, that in the poorer farming area studied by 
Pressey and Thomas, educational facilities were notoriously deficient 
and socio-economic level of the homes was lower than in the good 
farming area. 
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There is a considerable body of data from several European coun- 
tries demonstrating the existence of urban-rural differences in intelli- 
gence test performance. Thus Klineberg (48), in the investigation 
discussed in the preceding chapter, found that the urban-rural dif- 
ferences in performance on the Pintner- Paterson tests were much 
larger than either racial or national differences. The relevant data 
arc summarized in Table 66. The mean difference between the entire 


TABLE 66 Performance Test Scores of Urban and Rural Groups in 
Europe 



( 1' t (>in KImi' 

iitnj, 48, ji 

.’7 1 


Croup 

Ninnhrr 
of ('a\cs 

Mean 

Median 

Ran^e 

Paris 

100 

219.0 

218.9 

^ 00- 302 

Hamburg 

100 

216.4 

218.3 

105-322 

Rome 

100 

211.8 

213.6 

109-313 

Total city 

300 

215 7 

216 9 

100-322 

'Potal country 

700 

187.1 

J87.0 

63-314 


urban sample of 300 children and the entire rural sample of 700 is 
over eight times as large as its standard error, and is hence clearly 
significant. In terms of overlapping, only 3(). 12% of the rural chil- 
dren reached or exceeded the median of the urban children. It is 
also interesting to note that the three city groups, tested in Paris, 
Hamluirg, and Rome, dixered little among theinselvrs. None of the 
differences between these thicc city means is statistically significant. 
The rural groups, it will be rccalle i. revealed larger and fairly signifi- 
cant national dilTcrences. It would seem that the equalizing effect of 
life in a large cosmopolitan city tends to obliterate many of the dif- 
ferences arising from the specific national culture. Equally striking 
iirban-rurtil ditlcrenccs were found by Rosea (77) in Rumania. On 
a .series of locally con.^tructed intelligence tests given to 2032 chil- 
dren, the mean lb of the urban ch’Micn was 107, that of the rural 
groups, 86. 

A number of invcstiga.tions conducted in Great Britain have shown 
much less urban-rural differentiation in intelligence test performance 
than has been found in America or in other European countries. 
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Especially is this true of the more remote rural districts of Great 
Britain, which frequently show no inferiority to the urban areas 
(5, 90). A particularly clear demonstration of this point is provided 
by the results of one of the Scottish Surveys, in which the Stanford- 
Binet was given to a complete sampling of all children born in Scot- 
land on June 1, 1921 (cf. 78). The relative performance of urban 
and rural groups in this survey w^as closely corroborated by a more 
extensive survey with group tests. The Stanford-Binct results are 
shown in Table 67. Mean IQ’s are reported for the four cities, the 


TABLE 67 Mean IQ's of Children In Urban and Rural Areas of 
Scotland 


(I'umi Knsk, 78, p. 27 


Ann 

NiiinfH't 

(>} ( USl V 

Ml an JQ 

SD 

The Four Cities 

.319 

100.86 

1529 

Industrial Belt 

393 

99.19 

16.18 

I'nlire Rural Area 

162 

100.92 

14.52 

Highlands and Islands 

47 

101.79 

13.J3 


industrial belt, the entire rural area, and a subdivision of the rural 
area comprising the Highlands and the Islands, which represent the 
more isolated rural districts. Not only are there no significant dif- 
Icrcnces between the meari IQ’s of any of these groups, but also the 
highest mean and the smallest variability are found in the Highlands 
and Islands. In partial explanation of such findings, Rusk, the director 
of these surveys, observes that “perhaps novvliere has scholastic oppor- 
tunity been more evenly equated than in Scotland; 99.7% of Scottish 
teachers are fully trained” (78, p. 273). 

It may also be pointed out in this connection that rural living is 
relatively more desirable and enjoys greater prestige in the British 
culture than in many other countries. In explanation of the finding 
that remote rural areas in the British Isles sometimes rate higher on 
intelligence tests than do those rural areas which arc more accessible 
to cities, it has been suggested that selective migration may have 

'"’A •Nimdar siipciioniv of ihc nioie remote iiiial aieas has been found by Jones, 
Conrad, and fUanchaid (44) in our own New Hngland states. It is iioleworlhy that 
in New Cngland educational facil tics aie probably betrer, and mo c nearly iin form 
Irnm city to couniry, than in other sections of the IJiuicd Slates. 
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drained the more intelligent persons from the latter areas, but is less 
likely to have alTected the remote rural areas (90). The question of 
selective migration will be considered in the following section. 

SFLECTIVn MlCiRATION IN RELATION TO URBAN-RURAL 

Dll FERENCLS 

Migrations between city and country arc constantly occurring for a 
variety of reasons. During a pciiod of settlement and development, 
migration occurs predominantly from the urban to the rural districts. 
The westward expansion of the United States is an example of such 
a movement. The tide of migration soon turns, however, and the farm 
dweller is attracted to the city with its promise of wider vocational 
opportunities and other facilities. At any time, howevei, such eco- 
nomic events as the ('opening of mines, the discovery of o\\ or gold, 
and to a lesser extent, the construction of roads or the establishment 
of railway connections will bring about a sudden intliix into a pre- 
\’iously isolated area. These niiwcmcnts of population, cither en masse 
or by single individuals, depend upon a complex manifold of eco- 
nomic, political, social, and psychological factors. 

It has been repeatedly argued that the observed intellectual in- 
feriority of rural groups results primarily from selective migration 
rather from environmental handicap. According to this theory, the 
more intelligent, progressive, and energetic families or individuals arc 
attracted to urban centers, while the duller and less ambitious remain 
in the country. The opei, :ion of such a selective process for several 
generations would eventually lead to an inferior rural stock. It is 
probtibly true that in certain loca’ities migration may have drained 
the country of its best families, b '1 this cannot be offered as a uni- 
versally applicable conclusion. The opposite argument could just as 
readily be put forth in certain situations, i.c.. that it is the shiftless 
and the dull who migrate because they have been unable to succeed 
at home. The forces of selection arc loo dilhcult to disentangle, unless 
the specific history and conditions of the district under consideration 
arc known. No single generalization can be applied to all migrations. 

The most direct test of the selective migration hypothesis is through 
a study of the migrants themselves, vSuch a procedure was followed 
in a number of studies by Klineberg (49, 50). In one of these, 12- 
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year-old Negro school boys in three large southern cities were given 
the National Intelligence Test (49).^® Those who had migrated to 
these cities with their families were classified according to length of 
residence in the urban environment. A city-born group was also 
tested for comparative purposes. As can be seen from Table 68, a 
definite improvement in mean National Intelligence Test scores was 

TABLE 68 Mean National Intelligence Test Scores of 
Southern Ne^ro School Boys in Relation to Length 
of Urban Residence 


( I'loni Klimhcrj^, J> "^4) 


) CLi! V of 

L'lhan Rcsulence 

Nunihrr 
of CoU’K 

Mpuii Score 

One 

39 

38.3 

Two 

25 

43.2 

Three 

36 

44.7 

Four 

47 

62.5 

Five 

52 

56.2 

Six 

53 

62.2 

Seven and more 

165 

68.7 

C ity-born 

359 

74.6 


found with increasing length of urban residence. The difference is par- 
ticularly striking if we compare those who had lived in the city for 
only one year with those who had lived in it for seven years or more. 
The city-born children, who had been exposed to the urban environ- 
ment for twelve years, received a still higher mean score. These in- 
tellectual differences among the various residence groups may be 
attributed not only to the varying amounts of time which the subject 
had spent in a more favorable environment, but also to the age at 
which such environmental influences operated. Thus since all subjects 
were 12 years of age, those in the one-year residence group had not 
been exposed to the urban environment until the age of 11, when 
they were rclati\'cly more "‘immune” to the effects of environmental 
changes. The migrant group with the longest urban residence, on the 
other hand, had moved to the city at the age of 5 or younger. It might 
be added that there was no apparent basis in economic, social, or 

^‘^This investigation was part of the general study of selective migration among 
Negroes which was reported in Chapter 22. In the data now under consideration, 
however, the problem of noithern and southern Negroes does not enter, since all the 
migrations occurred from rural to urban areas within the southern states. 
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other conditions for any progressive decline in the quality of the 
migrating population during the period under investigation. Hence 
the obtained dilTcrences in ability between the different residence 
groups are not likely to have resulted from temporal changes in the 
nature of the rural groups which migrated to the city. 

Another study by Klinebcrg (50) dealt with white migrants from 
rural New Jersey to urban centers in the same general area. It was 
possible to examine the records of 597 migrant children who had 
taken intelligence tests in rural schools prior to their urban migra-* 
lion. These children were found to average slightly below the non- 
migrants in the same rural schools. The results of both of these 
studies suggest that the migrating populations did not represent an 
initially superior selection, but that they gradually improved after 
moving to the superior urban environment. 

It should be noted that both of these studies were concerned with 
children^ who did not themselves initiate the migration but simply 
moved with their families. Somewhat different results have been re- 
ported in studies on adult mii'rams. In such cases, the individuals 
studied are usually the ones who made the decision to migrate. In this 
respect, these studies might be said to be more direct. At the same 
time, it might be noted that from the viewpoint of long-range effects 
over a period of several generations, the data on the children of 
migrants are actually more relevant. 

In general, the studies on adult migrants do show a tendency for 
.the migrants from rural to urban areas to constitute a superior sam- 
pling of the rural populat m (27, 62, 80). In the most extensive of 
these studies. Gist and Clark (27) followed up a sample of 2544 
high school students in forty n ral communities in Kansas. All 
these students had taken the Tern; in Group Tests of Intelligence in 
1922-23, when their median age was 16 years. In 1935, when the 
median age of the group was 29, the investigators obtained data on 
the residence of these former students. Over 70% of the original 
group were found to have migrated, the investigators pointing out 
that the proportion would be even fjeater if they had included those 
who had left the area and could not be located. Of those actually 
found, 37.89% were living in urban communities, 32.19% had moved 
to other rural areas, and 29.92% had remained in the original com- 
munities. Several comparisons among various migrating and non- 
migrating groups revealed statistically significant differences in initial 
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IQ. Tlius the migrants to urban centers excelled both the non-migrants 
and the migrants to other rural areas; those who had moved to larger, 
cosmopolitan centers surpassed those who had moved to smaller 
cities; those who bad left the stale averaged significantly higher than 
those who had remained in Kansas; and rural non-farmers scored sig- 
nificantly higher than farmers. 

Two points should be borne in mind in interpreting these results. 
First, since the original sampling consisted of high school students, no 
information is provided regarding the L^wer levels of the population. 
There is some evidence to suggest that migrants may be draw^n from 
the extremes of the distribution (26, 102). Thus among moderately 
successful persons, it may well be that the more alert, ambitious, and 
intelligent are attracted by the superior opportunities olTcred by the 
cities. But among those who are eking out a bare existence close to 
a subsistence margin, it may be the more hopeless and destitute who 
are more likely to migrate. A second point is that selective migra- 
tion docs not imply a hereditary interpretation of urban-rural dif- 
ferences. if it should be demonstrated conclusively that the superior 
families tend to migrate to cities, such families may be superior be- 
cause of environmental factors within their original lirurroundings, 
and their offspring may in turn be superior because they are reared 
in a relatively favorable family milieud" 

SPI ( IFir iTY OF [NTFI FI CTirAL DlFFFRl NCFS 

AMONG SlK lO-FC'ONOMIC GROUPS 

There is a growing tendency to envisage group differences in 
terms of specific abilities rather than in terms of general intellectual 
inferiority or superiority. The application of this concept to racial 
and national comparisons has already been discussed (cf. Ch. 21). 
In that connection, it w^as pointed out that each culture ''selects” and 
stimulates certain abilities, skills, and fields of knowledge as the most 
significant, t hrough the fostering of certain talents, specific patterns 
of psychological development may be produced within each culture. 
Under such conditions, any attempts to evaluate the mentality of one 
culture in terms of another w'ould be misleading and would tend to 
give a decided advantage to the group within which tJie measuring 
instrument was standardized. The same may be true of urban-rural 

discussion of similar point in Chapter 22. 
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comparisons. Intelligence tests have been standardized predominantly 
on city children, because of the greater accessibility of the latter in 
large numbers. Consequently, such tests may be overweighted with 
items which favor the oily child, and may fail to sample adequately 
those abilities in which the rural child excels. 

There is a certain amount of evidence which supports such an 
interpretation of nrhan-j mal differences in intelligence test perforni- 
ancc. Rural children arc neither uniformly nor consistently inferior 
to urban children on all tests. Thus the city child may excel on the 
conventional tests of abstract intelligence, but the eoiintr) child may 
excel on tests of mechanical or musical aptitude (cf., c.g., 81), 
Moreover, an analysis of perfornuince on dilTerent in/ellic^enee test 
items suggests that the relative dilliculty of individual items may vary 
considerably for urban and rural populations. This was clearly brought 
out in a study by iemes, Conrad, and Blancluird (44). 4he subjects 
were 351 children between the ages of 4 and 14, all living in rural 
areas of Massaeliusetts and Vermont. The IO"s ol these children on 
the 1916 Stanford-Hinet were consistently inferior to the test norms, 
which had been obtained on a predominantly urban sampling. 

An examination of the peiformance of the rural children on each 
test of the Stanford-Binet scale, however, showed that tliese children 
V ere significantly inferior on only six tests. Their inferiority on other 
tests, although consistent from age to age, v\as statistically insignificant. 
The tests which yielded tlie largest degree of nirad inferiority were: 
those involving the use of paper and pencil, as in copying a square; 
tliosc depending upon s[ 'dlie experiences more ccmimon In an urban 
environment, such as familiarity with coins, street-cars, etc.; and dis- 
tinctly verbal tests, such as vocabulary and the definition of abstiact 
terms. In all these cases, the sp^’i ifie envin^mnentdl Juindkap of the 
rural child is apparent. Similarly, "'growth curves'’ for individual 
tests showed the greatest age divergence betw'cen urban and rural 
groups in such tests as vocabulary, dissected sentences, naming sixty 
words, and wa^ird definitions. A diminisiiing urban-rural dilTerence 
with age, on the other hand, was ^ound in such tests as ball-and-ficld, 

^^Thc inllucnce of cnviionmcnl:il 4lil.\’icnres is tiirther Ucnionstrated by ihc 
subjects’ pciloimancc on the four Pintncr-Palcrson lesis administered in this study 
On the Maic and hoal, the nnat chiidien surpassed the norms, their mean tQ being 
110 (cf. similar lesiilts ubtaincd with (his le^t by Baldwin, Fillmore, and lladlcy, as 
reported on p SH"! On the fUvc-i'i;uue Form Board and the Knox Cube, involving 
more absiract niatei rds, the\ were sl-ghtly tnfciior; unci in Digit-Symbol Substitution, 
a paper-and-peneil test, thev weie verv inferior. 
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giving the number of fingers on the two hands, counting thirteen 
pennies, and other predominantly non-verbal tests. 

Tlie investigators also demonstrated that the relative difliculty of 
individual tests in the Stanford-Binct, as determined by the per cent 
of children passing each test, may differ for urban and rural groups. 
For the rural children in their sampling, tests located within a single 
year level were often quite dissimilar in difficulty. Jn fact, the range 
of difficulty within a single year level was sometimes greater than the 
difference between successive year levels 

Such results suggest that the selection and placement of test items 
might be quite different if a test were standardized on rural rather 
than on urban samplings. A direct attack upon this question is to be 
found in a study by Shimberg (83). The basic plan of this investiga- 
tion was to standardize one lest on city children and a second, parallel 
test on country children. Both tests were then administered to both 
urban and rural groups, and the performance of the two groups on 
each test was compared. The particular test selected for this purpose 
was an information test. 1 his choice was justified on the grounds that, 
in the first place, such tests are frequently included in intelligence 
scales. Secondly, even in scmIcs which do not contain a#>cparate in- 
formation test, specific items of information are required in nearly 
all other tests. Thus a picture completion test, for example, implies 
the possession of information regaiding the characteristic appearance 
and function of presumably familiar objects. 

Each form of the information test consisted of 25 questions. The 
tests were “scaled,” i.c,, the questions were arranged in order of dif- 
ficulty and represented approximately equal increments of difficulty 
from the easiest to the hardest. This was accomplished by giving a 
large number of questions to the standardization groups and tabulat- 
ing the percentage of children who answered each correctly. From 
these percentages, the difficulty value of each question was computed 
in terms of a-unils. In the final step, the 25 questions which were 
most evenly spaced in difficulty value were selected for inclusion in 
the scaled test. This procedure was followed with 764 urban children 
for Information Test A and with 416 rural children for Information 
Test B. It should be noted that no question dealing with items of 
purely local knowledge was included in either form. Both forms were 
“fair” to city and country children in the sense that the subjects in 
either group had some opportunity to acquire the requisite informa- 
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tion. There were, in fact, a number of items common to the two 
forms. In the original scries from which the scaled items were selected, 
37 questions were identical in forms A and B. 

Both scaled tests were administered to two new groups of urban 
and rural children. The number of subjects employed in this part of 
the study was distributed as follows: 

Vi ban Rural 

Form A 6477 610 

Form B 9o2 4875 

The mean scores of urban and rural samplings on forms A and B of 
the test are shown in Figures JOO and 101, respectively. 




Fig. 101. Average Scores of 
Urban and Rural Children on 
Information Test B, Scaled on 
a Rural vSampling. (Data fiom 
Siumberg, 83, p. 50.) 


Fig. 100. Average Scores of Urban and Rural Chil- 
dren on Information Te^^t A, Scaled on an Urban 
Sampling. (Data Irom Shimberg, 83, p. 45.) 

I'^Thc'scores on Tests A and B are not expressed In the same icims. The foriner 
are IransmiUed T-scoic. w.ih a mean cf 50; ihe laticr r-'pteseni the acli.al number 
of correct items out of 2*'. 



828 Differential Psychology 

It will be noted that on Test A, which was constructed and scaled 
on city children, the urban groups excel (Figure 100). Among fourth 
grade children, this difference is 6.5 times as large as its standard 
error, and among fifth-graders 4.5 times as large. In the upper grades, 
the critical ratios arc under 3, and at the eighth grade the difference 
is reversed, indicating a very slight and insignificant superiority of the 
rural group. This reversal is attributed by Shimberg to the differential 
operation of selective factors in urban and rural groups at the upper 
school grades. Rural children as a whole, and one-room school pupils 
in particular, tend to be more retarded educationally than urban 
groups. Consequently, a large percentage of the duller children have 
left school before reaching the upper grades, and those who remain 
are a relatively select group. It might be added in confirmation of 
this explanation that age comparisons, from 9 to 16, revealed a con- 
sistent superiority of the urban groups on Test A. In terms of age, 
the rural children were approximately one year retarded on this test. 

On Information lest B the .situation is entirely reversed (Figure 
101 ). The rural group is now consistently superioi, the differences 
in its favor being completely significant at each grade. 1 he critical ra- 
tios of tliese differences arc all over 3, ranging from 5.56 let 9,33. \ hus 
the hypothesis which this investigation undertook to test seems to 
have been completely verified. The urban group excelled on the test 
constructed on city children., the rural group on that construded on 
country children. Either group might be ranked ‘Superior,” depend- 
ing upon the specific test employed. 

More recent investigations have provided preliminary data which 
suggest that a similar specificity of intellectual differences may char- 
acterize the abilities of different social classes. Such a finding was 
already suggCNted in an earlier section by the comparison of children 
of high and low socio-economic levels on different typies of intelli- 
gence tests. It will be recalled that the inferiority of the lower status 
classes tended to be greater on the more highly verbal tests. The 
analysis of intelligence test items shows even more marked specificity. 
This was illustrated in a comparison of the Stanford-Binet perform- 
ance of 140 first grade children of low socio-economic level with that 
of 114 first grade children of high socio-economic level (79). The 
former group did significantly better than the latter on tests involving 
counting, the handling of money, and sensory discrimination; the 
latter excelled significantly on tests involving vocabulary, verbal com- 
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position, rote memory, naming similarities and dilTercnces, and motor 
control. The Sianford-Binct as a whole also showed a larger differ- 
ence between these two groups than did the Goodenough Draw-a- 
Man Test. 

Of special interest in this connection is the research undertaken by 
Haggard, Davis, and Havighurst (IS, 33) on "cultural differentials” 
in intelligence test items. The first object of this project was to meas- 
ure the relative success of children of different socio-economic levels 
on the individual items of eight widely used group tests of intelli- 
gence. The data were obtained by administering these tests to all 
children of ages 9, K), 13, and 14 in “Midwest,” the same community 
investigated in the studies by Davis and Havighurst which were dis- 
cussed in an earlier section. The tests were found to vary widely in 
the proportion of their items which fa\'orcd children oi high socio- 
economic status. V.'ittiin any one test, wide differences in this respect 
were also found from one item to another. For example, an item 
based upon an understanding of the term “sonata” was passed by 
74.2% of children in a high socio-economic group and only 28.5% 
of those in a low socio-cjonc)mic gicmp. whereas an item involving 
the classiiicalion ol nulling t<H>5s was passed by 76% of the high and 
79% ol the low socio-economic group. 

These differences in the proportion of children of high and low 
socio-economic status who pass an item arc what the investigators 
mean by the “cultural differential” of the item. They maintain that 
items with high cultural differentials should be eliminated from intel- 
ligence tests, in much i e same way that items which favor cither 
sex to a marked degree are now commonly eliminated. Whether or 
not such items should be includ* d in a psychological test depends, 
of course, upon the purpose for v hich the test is designed. If it is our 
aim to study the wins in which social status classes differ, then it is 
just the items which differentiate maximally between such classes that 
arc of most interest. 

In an effort to analyze further the basis of cultural differentials 
in intelligence test items, the same investigators (18, 33) set up a 
carefully controlled experiment winch they conducted on 656 11- 
to 12-ycar-old children. The children were divided into a high and 
a low socio-economic group, the two groups being matched in age, 
school grade, and IQ. The effects of the following factors were in- 
vestigated: oral versus written presentation of items; the interpolation 
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of two hours of instruction and practice with problems similar to 
those on the tests; motivation, in the form of a promised movie ticket, 
for good work during the practice period or during the test proper; 
and the substitution of test items similar to those in the original test, 
but rendered ‘"culturally fairer” in their specific content. The lower- 
class children tended to profit more than the upper-class from the 
last-mentioned revision of items, although both groups did somewhat 
better on the “culturally fairer” forms of the tests. Lower-class chil- 
dren also gained relatively more from oral presentation and from 
added motivation during the test proper. 

It is apparent from all these studies that urban-rural, occupational, 
and other socio-economic groups dilTer in specific ways. Hence any 
statements regarding intellectual “inferiority” or “superiority” of such 
groups need to be qualified fully as much as do comparisons between 
the broader cultural groupings discussed in earlier chapters. Similar 
qualifications would of course apply to any generalizations regarding 
emotional adjustment or other personality characteristics, in which 
group differences are likewise rather specific. 
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24 


7 ha TuciiviAiial as a Mcmher 
of Multi p la (Jroitps 

Differential psycitoi ociY, in its broadest sense, is eoncerned with 
all variations in behavior phenomena among Individuals and among 
groups. The observation and measurement of sueli dihciences have 
led to the accumulation of a va.st body of descriptive material which 
has proved scicnfinccdly interesting and practically useful. F.xamplcs 
of such material have been given throughout the present book. The 
fundamental aim of dilTcrential psychology is not, however, the col- 
lection of descriptive material. Its aim is similar to that of all psychol- 
ogy, viz., the undcrsiiutdui^ of hcduivior. DilTcrcntial psychology 
approaches this problem through a comparative analysis of behavior 
under varying einironmcntal and biological conditions. Ry relating 
the observed dilTerenees in behavior to other known concomitant 
phenomena, it should be possible to tease out the relative contribu- 
tion of dilfcrenl factors to behavioral devclopmerh. If we can deter- 
mine why one person reacts differently from another, we shall know 
what makes people react a'' dic}^ do. 

The unit of dilTercntial psycliology is the individual, conceived as 
a reacting organism; our interest in groups is only secondary. Many 
traditi(>nal groupings, furthermore, Jiave proved to be arbitrary and 
ill-defined. From the standpoint of behavioral development, the effec- 
tive groupings are stimiilational and iK^t biological. It is not the race, 
or sex, or physical ‘d vpe’" lo which the indi\adual belongs by heredity 
that determines his psychological make-up, but the cultural group in 
which he was reared, the traditions, attitudes, and points of view 
impressed up(m him, and the type o. abilities fostered and encour- 
aged. Even when behavioral differences are found to be associated 
with phy.'.ically defined groups, it is usually the indirect social effects 
of such groupings, ratlicr than their biological characteristics, which 
influence behavior development. 
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Since all types of behavior are influenced by the subject’s stimula- 
tional background, it follows that psychological data obtained within 
any one cultural group cannot be generalized to cover all human 
behavior. Many statements offered under the heading of general 
psychology arc not general at all, but are based upon human behavior 
as it develops within a single culture (cf. 11, 26, 49). This limitation 
has sometimes been described as a “community-centrism” which per- 
vades much of our psychological information (49), It has been 
suggested (26, p. 256) that many textbooks of “general psychology” 
might be more accurately characterized as dealing with “the psychol- 
ogy of Americans and Western Europeans of the late eighteenth and 
early nineteenth centuries.” In a somewhat similar vein, Dollard 
(10, p. 17) stresses the importance of considering the cultural setting 
in which behavioral observations are made. He ventures to suggest 
that “to the social psychologist, the three most indispensable letters 
in the alphabet are T.O.C. (in our culture),” and points out that these 
qualifying letters should be regarded as implicit in all descriptions 
of behavior within our cultural setting. Such cultural restrictions un- 
doubtedly apply to much of the descriptive and facttial content of 
psychology. This does not, however, preclude the possibility that 
when the specific behavior is studied against the individuars stimu- 
lational background, the same principles of behavior will be found 
to operate (cf, 12, 14). Siich a study of group and individual dif- 
ferences in behavior should, in fact, help to clarify the common 
underlying principles of behavior development. 

C ULTURAL LRAMI S OF RFFLRENCF IN BHIIAVIOR 

The observations of anthropologists in various cultures provide in- 
numerable illustrations of the influence of cultural “frames of ref- 
erence” upon behavior.’ What is often regarded as a “natural” 
response to a particular stimulus may be “natural” only because of 
the social norms and standards which we have acquired in our own 
cultural selling. Our very conception of the world about us is influ- 
enced by our own specific rcactional history. A purely “impartial” 
or “objective” observer is a psychological impossibility. Each indi- 
vidual’s observation and description of any fact is conditioned by his 

^ For a fuller discussion of tins point and many additional illustrations, cf. K’ine- 
berg (26, 27) and Slicnf (49. 50). 
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special past experiences as well as by the more general traditions and 
customs inculcated by his group.- 

Even the simplest perceptual responses show evidence of such a 
cultural framework. Whether we perceive an object as light or heavy, 
long or short, hot or cold, pleasant or unpleasant may depend in 
part upon our previous, socially determined experiences. An interest- 
ing example is provided by the perception of family resemblances 
and differences in a number of primitive cultures. Malinowski (32) 
reports, for example, that among the Trobrianders, resemblance to 
the father is considered natural and proper, whereas the child is never 
said to resemble the mother or any of the maternal relatives. The 
existence of the latter t>pes of resemblance is vigorously disclaimed. 
Resemblances between brothers are likewise denied, although the 
resemblance of each brother to the father is granted! It is, of course, 
difficult to determine to what extent these reactions repicsent a re- 
fusal to admit the proscribed resemblance, and to what extent they 
indicate a failure to perceive the similarities of appearance. The 
results of many experiments on the effects of expectation and “set” 
upon perception, however, make it appear entirely plausible that the 
Trobrianders only notice those familial resemblances which have 
been institutionalized by their culture. 

Tt is well known that preferences for tastes and odors, as well as 
likes and dislikes for foods, vary widely from one culture to another. 
Among certain African tribes, cologne and scented soap evoked 
loathing and disgust (cf. 27, p. 209 ). On the other hand, odors which 
we find very unpleasant h ive at other times or places been used as 
perfumes. 

Popular conceptions of time and space, although commonly taken 
for granted, can readily be shown U, be cultur.illy determined (20, 27, 
49, 53). Even the concern with precise estimation of the time of 
occurrence and duration of events, so characteristic of our culture, is 
quite lacking in others. The indifference to time found among many 
primitive groups is illustrated by individuals’ lack of knowledge about 
their own age, and by their inabi'hv to indicate how long ago an 
event occurred if a period of several years has intervened. Other 
differences are to be found in the way in which time is reckoned. 

2 From this point of view, one may rej-^ard the instiuments and techniques of 
science as a means of reducing or minimi/ing the effect of the observer’s idiosyn- 
crasies. 
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The use of aslronomicaJ events as a framework for the measurement 
of time is by no means universal, many other familiar and recurrent 
events serving for this purpose among different peoples. Thus in 
Madagascar the natives refer to “a rice-cooking” when they wish to 
indicate an interval of about an hour, and to “the frying of a locust” 
to designate a much shorter lapse of time, in the Andaman Islands, 
it is possible to identify a succession of characteristic odors during 
the year, as different plants come to bloom. Odors also play an im- 
portant part in the magic of the Andamanese. It is not surprising, 
therefore, to find that these people have “adopted an original method 
of marking the dilferent periods of the year by means of the odorif- 
erous flowers that arc in bloom at different times. 1 heir calendar is 
a calendar of scents” (46, p. 3I I ). 

In the 'Furkish \iiiage of Kariik (50, pp. 378-370), which until 
recently was relatively untouched by modern technological develop- 
ments, times of the day arc indicated by such expressions as “first 
rooster,” “leaving of oxen” (for grazing), “mid-morning,” and “re- 
turn of oxen,” rather titan in terms of hours. Few of these villagers 
know the names of the days of the week, or calendar dates and 
months. Week days arc distinguished largely as market 9ays in vari- 
ous neighboring towns, where the villagers must go for all trading. 
Thus Sunday is SaiulliU Market, Monday is C/uresar Market, and 
Thursday is Cal Market. Divisions of the months arc based upon the 
appearance of the moon. In dividing the year, such seasons as sum- 
mer, fall, winter, and spring are recognized, and to them are added 
the seasons for “haying,” “end of harvest.” “sowing,” and other farm- 
ing and animal-raising activities. 

Space concepts arc ccjually dependent upon culturally determined 
frames of reference. That individuals’ conceptions of distance and 
geography arc largely colored by their own experiences is at the basis 
of the w aggish “maps” of the Dnited States w^hich have been prepared 
to portray, for example, a Bostonian’s or a New Yorker’s idea of the 
country. Like most caricatures, these “maps” undoubtedly reflect 
some bona hile perceptual differences resulting from the different 
interests, traditions, and knowledge of persons reared in different 
parts of the country. Other examples could easily be cited from every- 
day observation. I hc seasoned air traveler has a ver\ different con- 
ception of distances than docs the farmer who has never ventured 
farther than the village store. In the previously mentioned Turkish 
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village of Karlik (50), distances arc not ordinarily reported in kilo- 
meters or some other standardized system, but arc described in such 
terms as “within a bullet's reach,” or “as far as my voice can go.” 
More remote points arc indicated in terms of the length of time re- 
quired to reach them on foot. Confusions and misconceptions arise 
on the few occasions when such persons travel by train or bus, since 
they have no basis for translating the time spent in transit into their 
familiar frame of reference. A similar use of such “psychological 
units” to express distance has been observed among the Saultcaux 
Indians, who estimate distance in terms of the number of “sleeps,” 
or nights spent on the road (21 ). 

Even the familiar designation of directions in terms of north, 
south, cast, and west, although prevailing over a large part of the 
world, IS not a universal system. Thus among the natives of Dobu, 
space is conceived as a large garden clearing, such as the individual 
encounters in the daily life of his community. “Just as the garden has 
its inland border kaikai. its seaward border kunnkunnvana, and its 
sides jwfui, so also has space in its widest extension” (17, p. 131). 

The individuars memory for events he has observed or facts he 
has been told is likewise colored by his cultural background. This is 
particularly well illustrated by the observations and tests of Bartlett 
(3, 4) and Nadel (40, 41 ). Both investigators, working with South 
African tribes, have shown the important part played by cultural 
patterns in the “rcstructuralization” and distortions of recall. For 
example, when repeating a European story, individuals in these 
groups tended to cast it w :h ciiaracters typical of llicir tribal folklore. 
They likewise rearranged sequences and introduced twists of plot 
characteristic of their native storie 

Bartlett (3) reports a number of interesting observations made 
among the Swazis, a small group of Bantu natives of South Africa. 
In recounting an episode, or in answering a simple question, the 
Sw\'izis charactciistically repeat every detail, however irrelevant, and 
appear incapable of reaching the end ol the story without a rote 
recital of all the intervening steps. An especially vivid illustraticm of 
the effect of socially determined set upon recall is provided by an 
incident cited by Bartlett. A legal matter necessitated a visit to Eng- 
land by a Swazi chief and several of his leading men. When the party 
was questioned upon its return, their most vivid recollection was that 
of the English policeman regulating traffic with uplifted hand. In 
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explanation, Bartlett writes; “The Swazi greets his fellow, or his 
visitor, with uplifted hand. Here was the familiar gesture, warm with 
friendliness in a foreign country, and at tlie same time arresting in 
its consequences. It was one of the few things they saw that fitted 
immediately into their own well-established social framework, and 
so it produced a quick impression and a lasting elTect” (3, p. 248). 

Studies by the “//re association'''^ technique provide furfiicr indi- 
cations of the effects of conventionalized stimuiational backgrounds. 
In one of the most clear-cut of such experiments (16), 218 male 
subjects, including first and second year law students, first and second 
year medical studcnls, and liberal arts students, were tested with a 
forty-word list. The list contained twenty “neutral” and twenty “crit- 
ical” words arranged in random order. The critical stimulus-words 
were chosen so that each had a common legal meaning, a different 
medical meaning, and if possible a third neutral meaning not spe- 
cifically related to either field. The neutral stimulus-words were 
selected from an earlier standardized list (Kent-Rosanoff); they were 
words to which people in general gave relatively uniform responses 
which were neither legal nor medical in nature. Examples of the 
neutral stimulus-words included: rough, girl, long, riverf eagle. Some 
of the critical words were: hereditary, expiration, discharge, com- 
pensate, void, tender. 

A classification of the responses to the critical words showed that 
the first and second year law students gave 8% and 177'^^? more “legal” 
responses, respectively, than the control group; and they made 4% and 
5^% fewer “medical” responses, respectively, than the control subjects. 
The first and second year medical students, on the other hand, made 
25% and 30% more “medical” responses, respectively, than the con- 
trol group; their corresponding frequencies of “legal” responses were 
9% and 1 \ % fewer than those of the control group. Typical “legal’' 
responses to one of the critical slim ulus- words, “administer,” included 
such terms as administrator, estate, government, money, will; among 
the “medical” responses to the same stimulus-word were anaesthetic, 
dose, first aid, inject, syringe. The results of this experiment demon- 
strate the influence of occupational conditioning upon verbal reactions. 
Similar differences would undoubtedly be found with reference to 
broader cultural groupings. 

Cultural influences are also discernible in the motor habits of dif- 
ferent peoples. The gait and tempo of walking, as well as the charac- 
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teristic standing, sitting, and sleeping postures, vary widely from one 
culture to another. The carved ivory and wooden headrests of Africa 
which are preserved in our art museums impress the American ob- 
server as a most uncomfonable sleeping aid! Most primitive peoples 
sit in a squatting posture; Eskimos, as well as many American Indian 
groups, habitually sit on their heels (cf. 6). 

The role of cultural factors in the development of gestures has 
already been discussed in an earlier chapter (Ch. 22). A typical illus- 
tration of the cultural conditioning of a response often assumed to be 
“natural” and universal is to be found in gestures of negation and 
affirmation. Nodding to signify assent is by no means shared by all 
peoples, nor is the lateral turning of the head a universal sign of nega- 
tion. The Semang, a pygmy tribe of interior Malaya, say “yes” by 
thrusting the head sharply forward, and “no” by lowering the eyes 
(30). The Dyaks of Borneo raise their eyebrows for “yes,” and con- 
tract them slightly for “no.” For the Maori, raising the head and chin 
signifies “yes”; among the Sicilians, the same gesture means “no” 
(26, p. 282). The use of the fingers in pointing is likewise restricted 
to certain cultures. Among several American Indian groups, for ex- 
ample, pointing is executed with the lips (30). Changes in gesture 
patterns occurring over a relatively short period within our own cul- 
ture can be readily noted by looking at early movies. The seemingly 
“unnatural,” stilled, and exaggerated nature of the actors' gestures is 
immediately evident to the modern observer. 

Closely related to such observations of gesture is the comparative 
study of emotional expre ion in difTerenl cultures. A rich body of 
data is available in this area, indicating dilTercnces in the extent of 
emotional display, the occasions on vhich emotional behaeior is mani- 
fested, the specific patterning of emotional responses, and the degree 
of control which the individual is able to exert over such behavior 
(cf. 27, Ch. Vll; 30). The manner of greeting employed by different 
peoples would in itself constitute a fertile field for such inter-cultural 
comparisons. The practice of kissing, as a friendly greeting or as a 
sexual response, varies widely in diFcrcnt cultures and is totally absent 
in a number of primitive societies. It is interesting to note in this con- 
nection that Kinsey and his associates (28) found similar diflerences 
between social classes within our own culture. Among persons of 
lower socio-economic level, kissing as a sexual response was relatively 
infrequent and was even regarded by many as unhygienic. The latter 
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attitude was particularly interesting in view of the fact that it was fre- 
quently expressed by persons who habitually used common drinking 
cups and followed other practices considered insanitary in higher 
socio-economic levels. 

Many instances have been recorded of the ceremonial control of 
certain emotional reactions, such as weeping, to a degree which ap- 
pears surprising to persons reared in our culture. Ritual shedding of 
tears on a variety of prescribed occasions has been observed in such 
countries as China and Montenegro, in a number of American Indian 
tribes, and among the Maori, Andaman Islanders, and other primitive 
cultures (27, Ch. VII). 

An especially clear illustration of the effect of the cultural milieu 
upon behavior is furnished by acsiiielic preferences and artistic “taste.” 
The evolution of styles in music, painting, sculpture, architecture, and 
the other arts testifies to the shifting demands of “taste.” diie styles 
which are derided as harsh, barbaric, and uncouth by one generation 
have often been accepted as masterpieces by the next. Any artistic 
innovation which clashes too vigorously wilh the familiar and the tra- 
ditional forms of artistic expression requires a period of gradual 
habituation. It is an unfortunate but perhaps psychold^jcally indis- 
pensable fact that the great art leaders who are subsequently hailed 
as the initiators of new movements <4tcn suffer ridicule and derision 
during their lifetime. This follows necessarily from the fact that they 
come at a time when the adequate experiential background for the 
enjoyment of their products is lacking. 

The question of the sophisticated and the naive observer is also 
relevant to this point, fhe trained critic or the sophisticated observer 
has had certain specihe experiences which enable him to enjoy artistic 
products that may appear meaningless, indifferent, or even unpleasant 
to others. Psychologically, there is no “naive observer”; such an indi- 
vidual is naive only from the standpoint of a specific class of experi- 
ences. His judgments are, however, directly intluenced by other 
experiences which he has had. His artistic reactions will be largely 
dictated by common everyday observations and popular fashions. 
'Phus the observer may enjoy realistic art because he is more familiar 
with photographic reproductions of objects; or he may reflect some 
traditional artistic conception which has been inculcat'^d in him from 
early childhood. But in no case is his judgment made independently 
of experience. The essential difference between the sophisticated and 
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the naive observer is in the kind of past experience which they 
have had. 

It is a familiar observation that Occidentals who hear Chinese 
music for the first time find it not only discordant and harsh, but also 
unpleasantly loud. At the same time, it has been reported that the 
Chinese find American jazz and Wagnerian brasses disturbingly loud 
upon their first exposure to such music (cf. 27, p. 209). Similar data 
arc provided by the hislory of Western music, which clearly reveals a 
progressive shift in the point of demarcation between consonance and 
dissonance (cf. 39 ). Intervals which were considered dissonant at one 
period were accepted as consonant in the next, I he transition occurred 
from those intervals in which fusion of the notes is easily obtained 
to those in which fusion is more dirhcult. As the new^cr intervals came 
into use, the intervals which fused more readily declined in popularity. 
The preferred internals at any one period seem to have been those 
which were '‘just consonant,” i.e., those in which fusion was neither 
too easy nor too dillicull. The former were regarded as relatively unin- 
teresting, the latter as dissonant. 

As an experimental check upon such a “genetic" theory of conso- 
nance, Moore (39) ana!\/ed the repealed judgments of nine subjects 
on four musical intervals. Two of these intervals w^crc considered 
dissonances (major and minor 7th) and twx> consonances (3rd and 
5th) at the beginning of the experiment. The subjects underwent a 
period of habituation in which a)! four intervals were repeatedly ex 
pcrienced in musical passages. The judgments obtained at the end of 
this period showed certain aiiinistakable changes in the relative pref- 
erence for each interval. Of the two initially consonant intervals, the 
3rd lost rapidly in aesthetic value, vvhile the 5lh maintained a fairly 
constant level. The dissonances, on the other hand, showed a gain in 
preference, the minor 7lh gaining more rapidly than the major 7th. 
According to Moore’s tiieory, the region of highest aesthetic value for 
an interval is the “barely consonant” region. The greatest changes with 
repetition were therefore to be expected in die intervals nearest this 
region, viz., the 3rd on the one han i and the minor 7th on the other. 
This experiment furnishes a vivid demonstration of the dependence of 
artistic “taste” and aesthetic judgments upon experiential factors. 

Another relevant experiment is that conducted by Foley (13, 15) 
on occupational differences in preferential auditory tempo. The sub- 
jects were 684 girls between the ages of 13 and 20, all of w^hom were 
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enrolled in a trade school in New York City. Comparisons were made 
between groups in the following courses: power machine sewing 
(N = 90); hand sewing (N = 180); beauty culture (N = 165); 
typewriting (N = 182); and courses in domestic occupations, includ- 
ing waitress training, home nursing, nursery education, and others 
(N ~ 67). These vocational groups were roughly comparable in age, 
intellectual status, education, socio-economic level, and natio-racial 
background. The principal difference between them was in their spe- 
cialized vocational training. In the experiment, each subject listened 
individually to a scries of auditory tempos produced with a standard 
metronome. Six representative speeds were used, ranging from 56 to 
200 beats per minute and corresponding to the musical designations 
of largo, larghctto, adagio, andante, allegro, and presto. The six speeds 
were presented serially, in an ascending and then a descending order, 
the subject reporting whether she liked or disliked each. The proce- 
dure was repeated, when necessary, until the preference was narrowed 
down to a single speed in both the ascending and descending series. 


TABLE 69 Auditory Tempos Preferred by Different Vocational Groups 

( Vd.jplcd from l''ulcy, 15, j>. J25) 



Vocciiioiiul Gioup 


A Iran 

7 iniipo 

Appioximate 

Musiml 

Drsi^natiou 


'Typcwnling 

182 

178.08 

Allegro-Presto 


Tower miiehinc sev^iug 

90 

161.02 

Allegro 


Beauty cullure 

165 

n9.04 

Andante 


Hand sewing 

180 

134.46 

Andante 


T)omcslie occupations 

67 

133.61 

Andante 


The mean tempos preferred by each vocational group arc shown in 
Table 69. It is apparent that wide occupational differences were found, 
which icllcctcd the nature of the auditory stimulation to which the 
subjects had been exposed during their vocational training. Thus the 
beauty culture, hand sewing, and domestic groups chose the slower 
rales, corresponding closely to an andante tempo. The power machine 
sewing group preferred a relatively slow allegro, while the typists 
chose a fast allegro bordering on presto. Although individual differ- 
ences within each group were large, the mean differences showed a 
high degree of statistical significance. Moreover, the group differences 
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in preferential tempo were more marked between advanced groups 
than between groups which were at an earlier stage of vocational train- 
ing. The findings of this study are in close conformity to what would 
be expected on the basis of occupational conditioning. The machine 
sewers and typists, who preferred the more rapid tempos, had been 
exposed to loud, rapid, repetitive noises from the typewTiters and 
power sewing machines. On the other hand, rapid auditory stimulaUon 
does not accompany the activities performed by the hand sewnng, beauty 
culture, and domestic groups, which preferred the slower tempos. The 
habitual occupational activities thus seem to have shifted the sub- 
jects’ frames of reference, in terms of which their preferential judg- 
ments were made." 

We need not go beyond everyday observations in our own culture 
to find further evidence of shifting frames of reference in preferential 
responses. A vivid illustration is provided by the response to chang- 
ing fashions in women's wear. A style which appears beautiful to most 
observers when it is at the height of fashion will probably look dull 
and unattractive within a season, and positively ludicrous if viewed ten 
years later. These rapid changes in ‘'taste” come as no surprise to 
fashion leaders, since the fashion industry deliberately provides the 
stimulation which biings about the change in response. Upon the 
introduction of a new style, the buying public is exposed to a carefully 
planned and coordinated “blitzkrieg,” designed to prepare them for 
the acceptance of such a style. The new fashion is pictured in maga- 
zines and newspapers; models wear it on the street and in theatres^ 
restaurants, and other pu’ lie places; window displays feature it con- 
spicuously. Through these and similar techniques, the public is rap- 
idly “sensitized” to the new style in much the same manner that 
Moore’s subjects were habituated to the unfamiliar, discordant com- 
binations of notes. If the fashion is too ‘‘discordant” and clashes too 
violently with the previous experience of the public, the sensitizing 
process may fail and the fashion will be rejected. Similarly, tlie suc- 
cessful fashion leader keeps well posted on current developments in 
other areas — social, economic, polhical, artistic — in order to coordi- 
nate his innovations with the more general frame of reference of his 
consumer public. 

^The reader is referred to the original study (13) for a summary of evidence 
indicating that these group differences are not likely to have resulted from selective 
factors. 



848 Differential Psychology 

Even in the realm of science, where “objective truth” presumably 
reigns supreme, the effect of the individuars frame of reference can- 
not be wholly eliminated. The data of science admit of various inter- 
pretations. One or another of such interpretations may seem to follow 
inevitably from the given facts, depending upon the observer's experi- 
ential background. This is exemplified by the various approaches of 
different sciences to the same phenomenon, as well as by the presence 
of distinct ‘\schools” within a single science. There are “fashions” in 
science as in other areas. The general cultural milieu of the period is 
rellected in the nature of its scientific products and theories, just as it 
is in other phases of human activity. Jl is not a coincidence that cer- 
tain basic similarities can be found in such diverse phenomena as the 
science, art, social structure, and economic policies of any given 
period. I’he setting for all such developments is the common experien- 
tial background of the people of that age. 

“Drvi:LOPMr:NTAL siAcrs” and the ctiltural setting 

Theories of dcvclopmcnlal stages furnish numerous illustrations of 
the tendency to overgcneralizc from observations wifhin a single 
group. Child psychology is replete with such theories. Much interesting 
material has been gathered, for example, on the formation of concepts 
in childhood. The child's ideas about the physical wculd, his “con- 
sciousness of self,” his interpretation of dreams, and similar concep- 
tions have been analyzed into definite developmental sequences. Out- 
standing in this field arc llie theories of the Swiss psychologist Piaget 
(43, 4^ 45). 

in an extensive scries of investigations, Piaget arrived at the conclu- 
sion that the thinking of the child is animistic and that the transition 
from this initial animism to the adult's conception of the world is made 
through four major stages. For children between the ages of 4 and 6, 
everything active is alive. wSince children of this age arc also anthro- 
pocentric, “activity” is regarded as synonymous with usefulness to 
man. Thus the sun is active because it gives warmth, stones are active 
because you can throw them. At this first stage, therefore, all objects 
which are unbroken and in good condition are considered to be alive 
and “conscious.” In the next stage (6-7 years), only movable objects 
are believed to be alive. Jn the third stage (8-10 years), life is attrib- 
uted only to things which can move spontaneously. Thus the sun and 
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a river arc alive, but an automobile is not. In the final stage ( 1 1 years 
on), life is restricted to animals and plants, or sometimes to animals 
only. 

Such stages have been commonly accepted as an inevitable or nat- 
ural development through which the child must pass. There are, how- 
ever, numerous factors within the experience of a child in our society 
which might account lor such animistic tendencies. 7 he language 
which the child is tauglit encourages liim to form an animistic con- 
ception of the world. Thus he hears <hc sun referred to as “he," and 
the moon or a ship as ‘‘she." Figurative expressions, such as the 
“rising" and “setting" of celestial bodies, the “running" brook, and the 
“howling" wind, are not conducive to an impersonal conception of 
natural phenomena. If to this are added the fancies of poetry, fairy 
talcs, and other imaginative literature, it is apparent that the child’s 
experience has a strongly animistic flavor. It is not until he has had 
the opportunity to accumulate a certain amount of information from 
direct observation of cause and effect in everyday situations, that such 
a child can arrive at a realistic notion of the world. 

Data supporting such an experiential interpretation of the develop- 
ment of children’s concepts are to be found in studies on children in 
different cultures. Mead’s observations on the island of Manus in New 
Guinea led her to conclude that animism is absent in the thinking of 
Manus children (37). In both the spontaneous remarks of these chil- 
dren and in their replies to questions, she found evidence of a very 
realistic conception of natural objects and events. The drifting away 
of a canoe, for example, is rot attributed to malicious intent on the 
part of the canoe or to other supernatural factors, but to the fact that 
it was not securely fastened. Such an answer was obtained despite the 
fact that in her conversation with the child the investigator had 
attempted to place the blame on the canoe. Mead attributed this real- 
istic attitude to the type of training which such children receive. From 
early childhood they are forced to make a correct adjustment to the 
physical demands of their environment. The responsibility for a mis- 
hap is never shifted to an inanimate object, as in blaming the log if 
the child trips over it. If the child huUs himself, he is told that it is the 
result of his own clumsiness. It is interesting to note that, in certain 
respects, the adults are more animistic than the children in this cul- 
ture, since they explain sickness, death, and other misfortunes as the 
activity of “spirits." 
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Dennis (9) has argued that Mead's results do not disprove the 
applicability of Piaget’s developmental theory to the thinking of Manus 
children, since her methods of investigation were not comparable to 
those of Piaget. Mead's findings, according to Dennis, demonstrate 
only the absence of tendencies to personify and humanize inanimate 
objects, rather than a lack of animism among Manus children. What- 
ever tlie interpretation, however, these observations do suggest that 
the characteristics of child thought may vary from one culture to 
another. 

In a subsequent investigation by Dennis (9), designed as a more 
direct check of Piaget’s theories, 98 Hopi children were studied 
through standardized individual interviews and a group questionnaire. 
The survey dealt with ( I ) animism in the more restricted sense, i.c., 
being alive; (2) the attribution of “consciousness” to things; and (3) 
“moral realism,” as in the explanation that “the bridge felt because 
the boys crossing it had stolen apples.” In all three respects, the Hopi 
children were far more animistic and less realistic in their replies than 
white children of the same ages tested in other investigations. Dennis 
dismisses the possible explanation that differences in “intelligence” 
might account for the greater animism of the Hopi children, since their 
performance on the Goodenough Draw-a-Man 1 cst equaled or ex- 
celled the white norms. In the light of the known characteristics of the 
Hopi culture, Dennis concludes: “The explanation must, therefore, be 
sought in terms of environment, and no doubt in the cultural environ- 
ment rather than in terms of the physical environment. The differences 
in social environment between the Hopi child and the white American 
child are numerous” (9, p. 32). He further points out that the con- 
cepts of the Hopi children are similar to those reported by Piaget and 
others for white children, but that such animistic concepts are retained 
until a later age among the Hopi. 

Emotional development and personality adjustments have also been 
analyzed from the point of view of “stages.” The most widely discussed 
of such stages is probably the period of “storm and stress” character- 
istic of the adolescent. Almost all writers on child psychology ascribe 
emotional upheavals, personality changes, conflicts, and maladjust- 
ments to this age. There is evidence, however, to show that this is not 
a universal phenomenon. In certain societies (cf., e.g . 34, 35, 38), 
the adolescent assumes his altered status, both physical and social, 
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without emotional disturbance. His tasks arc cut out for him by tradi- 
tion; there arc no momentous choices and decisions to be made; no 
mystery attaches to his position; and no trace of embarrassment is 
encountered> 

There is much in our society, on the other hand, which fosters 
adolescent maladjustments, riuis the individual is placed in an ambig- 
uous and ill-defined position, being treated neither as a cliild nor as 
an adult. Restrictions upon his actions are frequently increased, while 
at the same time he is expected to be more self-reliant than he had 
formerly been. Embarrassment and a general atmosphere of mystery 
arc often directly induced by adults through their attitudes, remarks, 
and actions. In view of the many experiential factors in our society 
which might lead to adolescent maladjustments, there seems to be no 
need to posit an innate or physiological basis to the storm and stress 
of this period, nor to regard such emotional upheaval as a necessary 
developmental stage. 

Another aspect of child behavior to w'hich the concept of develop- 
mental stages has been widely applied is drawing. Children’s drawings 
have been collected in large numbers and submitted to detailed analy- 
ses, in the hope that they might furnish a clue to the child’s mentality. 
The best-known example of such a use of children’s drawings is pro- 
vided by the Goodenough Draw-a-Man Test, with its carefully stand- 
ardized scoring and extensive age norms. The voluminous literature on 
children's drawings reveals a widespread belief among psychologists in 
the existence of definite developmental sequences in drawing behavior. 
1 hese stages have often been regarded as products of maturational 
factors and assumed to be independent of specific environmental stim- 
ulation. The drawings characteristic of each age level are believed to 
be distinguishable in subject-matler as well as in many aspects of 
technique and execution. 

Such generalizations in regard to the drawing behavior of children 
are, however, limited to certain specific groups with a common cul- 
tural background. Spontaneous drawings by children of different 
national and cultural groups have been gathered and described by 
several investigators. ^ These data bring out very clearly the part played 
by the child’s environment in determining every phase of his drawing 
behavior. Thus the type of object most frequently drawn at each age 

^ For a survey of much of this literature, cf. 1, 19. 
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s,hows a wide variation from one group to another. In the studies on 
American children (cf., e.g., 1, 19), drawings of the human figure 
predominate at the younger age levels. That this is not a universal 
tendency among young children has, however, been repeatedly dem- 
onstrated. In a study on Swiss children, for example, the human figure 
occupied a relatively insignificant position, misccllancoLis objects and 
houses heading the list (25). Representations of people are likewise 
infrequent or almost completely absent in the drawings by children 
from several other countries (cf. 1 ). In general, the subject-matter of 
children’s drawings varies so widely from group to group as to make 
any attempted universal classification quite meaningless. 

Similar differences are apparent in all other aspects of the draw- 
ings (cf. 1). Whether the child draws broad panoramic views or 
scenes at close range, isolated objects or organized pictures, imagina- 
tive themes or realistic portrayals seems to depend in large measure 
upon his specific environmental milieu. In certain groups, the drawings 
are full of action, in others stationary objects and figures predominate. 
The organization of the picture likewise differs from one group to 
another. In some groups, a single unified scene is most often presented, 
in others a sequence of events, in still others isolated^objects. The 
degree to which color is employed, as well as the choice of specific 
hues, usually reflects the influence of both physical environment and 
social traditions. 

The representation of detail brings out some interesting facts. In 
certain groups, detail is relatively poor, total impressions and broad 
vistas being emphasized. In others, the minutest details arc painstak- 
ingly drawn into the picture. An even more significant point, how- 
ever, is the specificity of the details which are represented. Thus 
among a group of children belonging to a hunting tribe in Siberia, 
remarkably accurate and naturalistic drawings of reindeer and elk 
were obtained (48). These drawings were clearly superior to those of 
the human form or of any other subject executed by the same children. 
It should be noted that none of these children had had previous expe- 
rience in drawing. The investigator points out that the sharpened 
visual perception, manual dexterity, and keen observation fostered in 
such a hunting tribe probably influenced the accuracy of the drawings, 
especially when the objects represented were the animals commonly 
hunted by the tribe. 
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This tendency to elaborate those details which are specifically 
observed and which play an important part in the individual’s every- 
day activities, while ignoring other details, is very commonly found in 
children's drawings. Examples can easily be multiplied (cf. 1). Thus 
in one survey (1), the drawings by European children usually por- 
trayed vegetation only in a general way, with no attempt to show spe- 
cific type or variety. 1 he children from many tropical and semi- 
tropical countries, on the other hand, often featured fruit trees and 
dense forests as a major part of the drawing, and included suflicient 
detail to indicate the particular type of plant pictured. One group of 
drawings by Hungarian and Czecho-Slovakian children, although 
crude in other respects, showed minute and carefully executed details 
in the national folk costumes. In drawings by American Indian and 
Balinese children, elaborate details occurred in the ceremonial 
masks and head-dress, in contrast to the paucity of detail in other 
objects. 

Stylized representations and special cultural attributes are also dis- 
cernible in the drawings by children in certain cultures, becoming in- 
creasingly apparent with age (cf. 1, 38). In a study of Hopi children 
with the Goodenough Draw-a-Man Test, the younger children tended 
to draw generalized human figures, while approximately one-third of 
the 10-ycar-olds drew figures in which special characteristics of the 
Indian culture could be recognized (8). A similar tendency has been 
noted among Indian children of the Northwest Coast of Canada. In 
one such survey (2), the instructions were simply to ‘'draw' an ani- 
mal.” The subjects consisted of 159 Indian children between the ages 
of 5 and 18, all attending an Indian school at Alert Bay. About half 
of the group belonged to the Kwii'dutl, the remainder being divided 
among five other tribes found in British Columbia. Many of the 
draw'ings reflected the habitual activities and interests characteristic of 
life in an Indian community. Twenty were clearly recognizable as 
stylized representations executed in the traditional manner of the par- 
ticular Indian culture. Both the subject matter and technique of these 
drawings showed the influence of the subjects' institutionalized be- 
havior. Among the animals portrayed were the killer whale, sea lion 
thunder bird, and mythical double-headed serpent. That a mythical 
creature should be drawn at all in response to the directions to "dT'aw 
an animal” is itself an indication of the strength of the cultural influ- 
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ences. The highly conventionalized technique of these drawings can 
be seen in Figure 102, which shows samples of the killer whale and 
sea lion. It should also be noted that such stylized drawings were more 



Fig. 102. Drawings by Indian Children of the Northwest Coast, in 
Response to Directions to Draw an Animal. (From Anastasi and Foley, 
2, p. 369.) 


common among boys, having been produced by 17% of the boys and 
only 7% of the girls. This is in keeping with the ftict that painting and 
carving are conducted exclusively by the men in these tribes. In the 
light of all these findings, it v/ould seem hazardous to regard the 
richness of detail, general technique, or any other feature of children’s 
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drawings as an index of developmental stage, unless such features are 
considered in reference to the child’s cultural background. 

LANGUAGE AS A CULTURAI. FACTOR IN BEHAVIOR 

The data of comparative linguistics and anthropology — and more 
recently the writings of the semanticists — have suggested the impor- 
tant part which the nature of a people's language may play in their 
conceptions of the world about them, their attitudes, and other be- 
havior characteristics (cf,, e.g., 7, 22, 29, 31, 33, 42, 4*7, 54). In a 
very fundamental sense, language provides the tools for much of our 
thinking. The relationship between language and thought has been 
vividly expressed by Whorf (54, p. 231 ), who points out that each 
particular language '’is not merely a reproducing instrument for voic- 
ing ideas but rather is itself the shaper of ideas, the pn^gram and guide 
for the individual’s mental activity, for his analysis of impressions, for 
his synthesis of his mental stock in trade.” As he further states, “We 
dissect nature along lines laid down by our native languages.” In a 
similar vein, Mauthner (33, p. 4) wrote, “If Aristotle had spoken 
Chinese or Dacotan, he would have had to adopt an entirely dilTercnt 
logic, or at any rate an entirely different theory of categories.” 

Language influences the type of distinctions and discriminations 
which we make in observing our surroundings. Objects and events in 
nature do not, of course, occur in the distinct categories to which wc 
have become accustomed. Such categories have generally been de- 
veloped to fit specific purposes and to facilitate our dealings with 
objects. Once objects arc put into a specific category, or “named,” 
however, our attention is thereby focused upon their similarities or 
common characteristics, and we tend to ignore differences among 
members of the class. Thus what we notice and what we overlook in 
our environment depend in part upon our particular linguistic system. 
When the conditions existing within a given culture have made certain 
distinctions important, w^e are likely to find separate words corre- 
sponding to such differentiations. 

This is illustrated in Figure 103, in which certain words in the 
Eskimo and Hopi languages arc compared with their English equiva- 
lents. Thus to correspond to our one word, “snow,” Eskimos have 
several words, indicating “falling snow,” “slushy snow,” “hard- packed 
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snow,” and so on. On the other hand, the Hopi use a single word to 
designate “anything which flies, exclusive of birds.” An insect, an 
airplane, and even a pilot would be called by this single name, the con- 
text determining which was meant. For our one word, “water,” how- 



Hopi-Pahe \jlMaiBS5aSiMi!7 Hopi-Keyi 


Fig. 103, The Classihcatioii of Objects in Different Languages. (From 
WhorL 54, p. 230.) 

ever, the Hopi have two words, one referring to “flowing water’’ and 
the other to “water in one place, held within a container.” l^ixamplcs 
could easily be multiplied. In Arabic, the number of different words 
relating to “camel” is said to be about six thousand (51). There are 
terms to refer to riding camels, milk camels, and slaughter camels; 
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otner terras to indicate the pedigree and geographical origin of the 
camel; and still others to differentiate camels in different stages of 
pregnancy and to specify innumerable other characteristics important 
to a people so dependent upon camels in their daily life. 

A particularly interesting example of the role of language in the 
classification of observed phenomena is provided by color tcrmivolo^y. 
The way in which hues arc grouped varies in different languages, «md 
probably in turn affects the type ol color discriminations which arc 
customarily made in each particular culture. In certain modern Euro- 
pean languages, there are different words foi “light blue” and “dark 
blue,” just as in English we have the terms “pink" and “red." The 
Ashantis of the African Gold Coa^t have color names for black, red, 
and white: the term “black” is used (or any dark color, such as blue, 
purple, or brown; while “red” also covers pink, orange, and yellow 
(52). In the same group, gray is expressed by the word for “wood 
ashes,” and green by the term for “tree" or “leaf” (52). Among the 
Manus of New Guinea, yellow, olive-grecn. bhic-gray, gray, and lav- 
ender are regarded as variations of one color (.3b). The terms for 
“Hue” and “green” arc often combined in primitive languages. We 
cannot, of course, conclude from these linguistic classifications that 
the color sensitivity ot such peoples is inferior to or diflcicnt from 
ours. Objective tests of color-blindness have demonstrated a normal 
ability for color discrimination, despite the lack of differentiating 
terminology. It is apparently the specific conditions of the particular 
culture which determined the type of classification developed in 
each case. 

Not only vocabulary, but also the formal aspects of language, show 
characteristic differences trcmi on ‘ culture to another. 1 hus in the 
Hopi language there arc no lempc ral references in verbs, but special 
forms are employed to indicate the nature of the statement, such as 
immediate obscivation, memory or geneialization (d. 54). Similarly, 
among the Hupa Indians ol California, a suffix is used to designate 
the source of information, such as hearing sight, or conjecture from 
circumstantial evidence (18). On distinction between nouns and 
verbs is not so fundamental or universal as might be supposed lo 
take another illustration from the Hopi language, such terms as 
“lightning,” “wave,” “meteor,” “puff of smoke " and “pulsation” are 
verbs, as are all events of necessarily brief duration (cf 54) In some 
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languages, the distinction between verbs and nouns is non-existent, all 
terms corresponding most nearly to our verbs. I'hus “it burns” would 
signify a flame, and “a house occurs” or “it houses” would refer to 
our noun “house” (cf. 54). 

In an analysis of the extensive data collected by Malinowski on the 
Trobriand Islanders, Lee (31) proposed a provocative theory regard- 
ing the language of these people and its role in their other behavior. 
The Trobriand language, according to Lee, shows a focusing of atten- 
tion upon disparate elements or acts, considered independently, rather 
than upon relationships among events. Their sentenees are composed 
of essentially unrelated words. The comparative and superlative de- 
gree are absent, as arc pure adjectival concepts; adjectives in this 
language refer to specific classes of objects and cannot be abstracted 
from them. Caiisc-and-eflcct relationships are not conventionally 
expressed. When questioned regarding such causal relations, the indi- 
vidual Trobriander does not have ready-made answers provided by his 
culture. Each individual must think out his own answer, the replies 
showing confusion and disagreements. Chronological sequences are 
likewise unimportant to them. “The past is not an ordered series, but 
rather a chaotic repository of unrelated events, which, •at best, are 
remembered as anecdotes” (31, p. 360). 

A frequently reported observation is the relative scarcity of abstract 
terms in most primitive languages. Such a condition, of course, makes 
abstract thought much more difficult. It need not, however, imply an 
inability to carry on abstract thinking, any more than color-blindness 
is implied by the color terminologies discussed above. The presence 
or absence of abstract terms in a particular language may simply re- 
flect the conditions of life within that culture. I'hcre is some evidence 
suggesting that terms of a higher level of abstraction can often be 
developed by such peoples when a situation is presented which requires 
such terms (cf. 27, p. 46). 

Finally, it should be borne in mind that language is, essentially, 
behavior (cf. 24). It is not an independent entity, as philologists are 
sometimes inclined to regard it. At the same time, language serves as 
a potent cultural influence. The particular system of linguistic terms 
and forms institutionalized by a given culture represents an important 
part of the total complex of stimulation to which each individual is 
exposed. Regardless of how such linguistic behavior originally evolved 
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in the group, it assumes a major role in shaping the psychological 
development of the individual. 

“human nature” in different CUl.TURES 

Certain ways of acting have long been popularly regarded as “nat- 
ural.” This designation usually implies that the behavior in question is 
“normal” as well as innate and biologically predetermined. Closely 
related to this concept are those of ‘ perversion” and “reversion.” The 
former refers to behavior which is considered “unnatural”; the latter 
implies a revival ui reinstatement of a more “primitive” and less 
“artificial” type of behavior. Thus if one type of behavior is assumed 
to be natural, then any environmentally produced variation of such 
behavior is considered a perversion. Similarly, if a “civilized’’ person 
be pul in a “primitive” environment, the behavioral changes which 
may ensue are regarded as a reversion to a natural state. The latter is 
implicitly assumed to have existed all along, but to have been held in 
abeyance, so to speak, by conditions in a civilized community. It is 
apparent that the concepts of perversion and reversion have meaning 
only as long as one specific way of behaving is assumed to be the 
“natural” way. 

It has been repeatedly demonstrated, however, that no one form of 
behavior is any more natural than another in the sense of being pre- 
determined by innate constitution. The data on tliis question are 
derived chiefly from two sources. The iirst is the experimental pro- 
duction of behavioral v;.iiations. A number of such experiments on 
infrahuman organisms have been reported in Chapter 6. The import 
of their results was to show that CifTerent types of behavior will follow 
as a natural result of varying e.ivironmcntal conditions. Much so- 
called instinctive behavior has been shown to be natural only under 
given environmental conditions. 

The same point has been demonstrated by inter-cultural compari- 
sons. Many forms of behavior which have been labeled “instincts” and 
“fundamental drives” are found kt differ significantly from one cul- 
tural group to another.'"* T hus the rule of cultural factors in the expres- 
sion of the maternal drive is illustrated by the widespread custom of 

For many illustrations of tins point, cf. Klineberg (27) Ch. V and VI, and 
Shfrif (4*-)). 
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adopting children, which is practiced among several Melanesian, 
South African, and American Indian groups. In certain tribes, chil- 
dren are so infrequently reared by their own parents that it is very 
^difficult to obtain genealogies. Similarly, in ancient China the social 
concept of maternity was distinct from biological maternity. Thus all 
offspring of "‘secondary wives” within the family unit were considered 
to be children of the “first wife.” The latter was the only person in the 
role of mother, the other wives being indiscriminately regarded as 
“aunts” by their own as well as other children in the family (cf. 27). 

Aggressiveness and fighting, popularly considered to be among 
primitive man’s natural impulses, are unknown among several groups. 
In a few tribes, for example, no weapons or implements of warfare 
arc to be found. That men should attack each other seems inconceiv- 
able to individuals reared in such cultural grou})s. Simikirly, acquisi- 
tiveness and the desire for pcrst)nal property are not a universal phe- 
nomenon.^* A striking demonstration of this fact is provided by the 
social institution of the potlatch, as found among the Indians of the 
Canadian Northwest Coast. In this culture, social prestige is achieved 
through the distribution or giving-away of personal prt)pcrty, rather 
than through its acquisition. 

The manifestations of the drive, with its attendant feelings such 
as love and jealousy, likewise exhibit wide inter- cultural variations. 
The diverse customs and conventions associated with mating behavior 
in different groups have been extensively described by anthropologists 
and many arc undoubtedly familiar to the reader. It will be recalled 
that similar differences in the t>pical manifestations of sex behavior 
were found by Kinsey et al. (28) in their comparisons of diherent 
socio-economic classes within our own culture. Mention may also be 
made of tlic sets of rules and restrictions imposed by dillerenl societies 
upon many forms of behavior, including aggressiveness and physical 
violence, reaction to personal pro])erty. sck activities, and others, flic 
wide variations in such restrictions show them to have little or no 
basis in “human nature” as such. The tuores of one society often 
appear as a quaint set of taboos to another. 

The traditional sex differences in abilities and in personality traits 
arc anotlicr case in point. It was long considered “natural” for the 
sexes to differ in general intelligence and c-pecially in aptitude for 
scientific pursuits and similar branches of learning. Men, too, were 

e.g., Beaglehole (5) for a coinprchcnsi\e ticatmciit ol this question. 
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regarded as naturally more stoical, less given to emotional displays, 
more competitive, less sympathetic. If a given individual displayed the 
intellectual or personality traits of the opposite sex, this was consid- 
ered “unnatural.” An understanding of the experiential basis of these 
behavioral characteristics shows the artificiality of such a distinction 
between natural and unnatural behavior. 

NATURE AND V^RIITY OE PSYClIOl OGICAL GROUPS 

Psychologically the individual belongs to every group with which he 
shares behavior.'" Fiom this point of view, group membership is to be 
defined in terms of behavioral rather than biological categories. The 
effective grouping is not based upon the individuars race or sex or 
body build, but upon his experiential background. 7hus if the indi- 
vidual is reared as i member of a certain national group with its own 
traditions and cultural background and its own peculiar complex of 
stimulating conditions, he will display the behavioral characteristics 
of that group regardless ol his racial origin. It should be understood, 
of course, that meie physical presence does not constitute group mem- 
bership in a psychologicMl sense. 1 hus if a Negro child were brought 
up in a community composed exclusively of whites, he would not nec- 
essarily receive the same social stimulation as a white child. Similarly, 
a boy who is brought up exclusively by female relatives will not 
develop the personality traits of a girl. A psychological group is based 
solely upon shared behavior and not upon geographical proximity or 
biological resemblance. 

It follows from such a concept of group that any one individual is 
effectively a member of a large .md varied set of groups. A multi- 
plicity of behavioral proups, large and small, cut across each other in 
the individual’s background. Some of the most important of these 
groups have already been discussed in Part III of the present book. 
The individual is born into a broad cultural division such as, for exam- 
ple, “Western civilization,” with its characteristic sources of stimula- 
tion. He will develop certain aptitudes, emotional trails, attitudes, and 
beliefs as a result of his affiliation with this group. He is also a mem- 
ber of a given national group with its more specific traditional ways 
of acting. 

This criterion of a psychological group is essentially that lormulated by Kantor 
( 23 ), who seems to ha\c been the first to discuss social behavior in teiriis of shared 
responses to objects having common stimulus functions. 
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If the individual displays certain physical characteristics, such as a 
particular skin color, facial conformation, and body build, he may be 
classified as a member of a given “racial” group which occupies a 
distinct position within the broader national division. In so far as his 
racial background leads to certain social distinctions and culturally 
imposed differentiations of behavior, it will operate as an effective 
grouping. The same may be said of sex. If, within a given society, 
traditional beliefs in regard to sex differences exist so that the sexes 
are exposed to dissimilar psychological stimulation, then the individ- 
ual’s sex will in part determine his behavioral characteristics. 

There are a number of other behavioral groupings which, although 
less frequently recognized and less clearly defined, may be equally 
influential in the individual’s development. Thus it will be recalled 
that important psychological differences are usually found between 
the city-bred and country-bred child, as well as between different 
social-status classes (cf. Ch. 23 ) . Similarly, the particular state, prov- 
ince, or other major division of a nation in which the individual is 
reared, and even the specific town and neighborhood in which he 
lives, will exert significant influences upon his intellectual and emo- 
tional development. 

Other groups with which an individual identifies himself behavior- 
ally are his occupational class, his religious sect, his political party, his 
club, his educational institution. That such groupings represent clear- 
cut cultural distinctions is readily illustrated by the stereotypes which 
have become attached to many of these groups. To people within our 
society, a distinct picture will be suggested by the mention of such 
designations as country doctor, business man, Roman Catholic, Or- 
thodox Jew, Republican, Rotarian, Harvard man. These groups 
influence the individual’s behavior in two ways. First, they directly 
stimulate and foster certain ways of acting. Secondly, the reactions of 
other people to the individual are influenced by their knowledge of his 
group affiliation. The social attitudes and “social expectancy” which 
the individual encounters will in turn affect his behavior. 

Family groupings, with their characteristic activities and traditions, 
constitute another important part of the individual’s psychological 
environment. The famous Herreshoff family of boat-designers and 
builders, the degenerate Kallikaks, eminent families such as the Hux- 
leys and the Darwins, and many other striking examples testify to 
the cultural influence of family membership. Cutting across such 
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family groupings are age distinctions. “Stages” are socially imposed 
upon the continuous life activities of the individual and he is treated 
more or less diflercntly at each period. The individual may also look 
upon himself as belonging to a particular generation — he may be a 
member of the “older generation,” the “young married set,” the “teen- 
agers,” and so forth. Lven such apparently minor factors as one’s 
hobbies and recreations will in turn alTect the individual’s subsequent 
beliavior. Psychological membership in many new groups may result 
from a newly developed interest in bowling, stamp collecting, or early 
American pressed glass. The number of behavioral groupings could 
easily be multiplied. These examples will sullice to illustrate the nature 
of such groupings and their cifcct upon the individual. 

TITH MTANINCS OF INniVTOUALITV 

The individual may be regarded partly as a resultant of his multiple 
group memberships. To be sure, each individual also undergoes expe- 
riences which are absolutely unique to himself. Such experiences are 
probably less significant, however, in shaping the more basic aspects 
of his personality than is his shared behavior. The experiences which 
are common to a group of individuals have a certain degree of perma- 
nence in the sense that they will tend to be repeated more often and 
to be corroborated or reenforced by other similar experiences. In 
general, the more highly organized the group, the more consistent and 
systematic will be the experiences which its members undergo. This 
will tend to make the sh .red experiences on the whole more eflcctivc 
than the purely individual. Moreover, even the individual’s idiosyn- 
cratic experiences will generally have certain cultural features which 
differentiate them from the idiosvncratic experiences of persons in 
other cultures. Thus an individual may compose a poem which is 
unique in its totality and to this extent unlike any poem ever written 
by any other person; but the fact that the poem is a political satire, 
that it is composed in the English language and in iambic pentameter, 
and that it is written with a bail-point pen are among the many dis- 
tinctly cultural features of such an activity. 

In view of the pronounced effect of such shared or common be- 
havior upon the individual’s development, it may appear surprising 
that individuals are no more alike in their behavior repertoire than 
we ordinarily find them to be. The extent of individual differences 
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within any one group is extremely large. In fact, the variations among 
individuals have always proved to be more marked than the differences 
from one group to another. How can the “individuality” of each per- 
son be explained in terms of his shared experiential background? 

The key to this problem seems to lie in the multiplicity of overlap- 
ping groups with which the individual may be bchaviorally identified. 
The number of such groups is so great that the specific combination 
is unique for each individual. Not only does this furnish a stimula- 
tional basis for the existence of wide individual differences, but it also 
suggests a mechanism whereby the individual may “rise above” his 
group. There are many examples of individuals who have broken 
away from the customs and traditional ways of acting of their group. 
Through such situations, modifications of the group itself may also be 
effected. 

in these cases the individual is not reacting contrary to his past 
experience, as might at first appear. This would be psychologically 
impossible. His behavior is the result of psychological membership in 
various conflicting groups. Many group memberships can exist side 
by side in a composite behavioral adjustment. But in certain cases two 
or more groups may foster different ways of reacting to \kiQ same sit- 
uation. This enables the individual to become aware of the arbitrari- 
ness of the restrictions and traditions of each group, to evaluate them 
critically, and to regard them more “objectively.” Membership in 
many diverse groups frees the individual from the intellectual and 
other limitations of each group and makes possible the fullest develop- 
ment of “individuality.” 
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and behavior development, 158f. 
and individual dilTerenccs in behavior, 
374fr. 

Chance, concept of, 63f. 


Chapin Living-Room Equipment Scale, 
801 

Character traits, 

distiibiilion of, 86ff. 
of gifted children, 6O0f. 
sex differences in, 668ff. 
sibling correlations in, 322 
Cheating tests, distribution of scores 
in, 86 

“Child prodigies.” 59511. 

Child psychology, and culture, 8481f. 
Child-rearing practices, 

cuitu.al differences in, 129, 164, 180f. 
in ! elation to “national chaiactci,” 
777 

socio-economic dilTcrenccs in, 792f. 
Childhootl of eminent men, 59111. 
Chimpan/ees, reared in human environ- 
ment, 1731f. 

Chinese, 704, 739 
C’hiiognomy, 382 
Chiomosomes, J02fT. 

and sex diflert*nces, 10*^1., 631, 63'^ f. 
Coaching, on psychological tests, 129, 
200f., 218 

CoefTiciciit ol variation, OOff., 2061. 

Color blindness, 1051. 

sex differences m, 647 
Coior tciminology, and cull^ire, 857 
“Ccimmon” ti.uls, 524fT. 

Community-ccnti ism. 838 
Comparable scores, 459fT. 

Coneept Masteiy Test, 602 
Conditioning, 
in neonate, 153 
prenatal, 153, 155f. 

C’onsonaiHc, cfl'ecl of experience on, 845 
C’onstiliitional types, ,S’( Type iheoiies 
('onveisatioi)s, scs diflerences m, 665f. 
Convexity of piotile, 380f. 
Coopeiativcncss tests, distribution of 
scores in, 87 

Correlation coeflicicnt, 40f., 304 
and mean intia-puir difleicnee, 329 
and trait variability, 482f. 
effect of hvteiogcneity on, 507f. 
Correlation ratio (era), 761 
Co-tw'in control, method of, 164, 1751T., 
347f. 

Cranial capacity, 

and intelligence, 375ff. 
and race classification, 693 
Cretinism, 395, 549 
Clime, 

and “national charactei ” 777 
and race, 707f. 

Criteiion, 45 
Critical ratio, 61 6f. 



Cross-comparison, of cuJtuial and bio- 
logical groups, 130, 747, 764ff., 
772ff. 

Cross-sectional studies, 267f. 

Cultural diffeiences, 

and age comparisons, 268, 278 
and race difTerences, 747fl. 
color terminology, 857 
“human nature,” 859(T. 
in abnormality, 566tT. 
in infant-rearing practices, 129, 164, 
180f., 777 

linguistic categories, 85 
male and female personality, 640ff, 
“Cultural dilTerentials” in intelligence 
test items, 8291. 

Cultural 1 actors, 

icsthctic prclercnccs, 8440". 
children’s concepts, 84Stf. 
childicn’s drawings, 85 Ilf. 
color classications, 8 ‘>7 
concept ot intelligenv c, 488, 7401T. 
developmental “stages,” 84811’. 
emotional development, 850f. 
emotional expression, 843 
gcsuiie patteins, 77"tr., 843 
“human nature,” 8590'. 
intelligence tests, 73311., 740fT, 
language, 8 ^'^11. 
memoiy, 84 If. 
moli\alion, 859f. 
motor habits, 842f. 
musiLcil pi cfei cnees, 845fr, 
percepiion, 839tl. 
pcisonaJity, 771tT. 
psyehologieal geneializat’ons, 818 
race dilfererices, 73311'., 747. 77 Iff, 
scienee, 848 

sex diffeicnccs, 62jf., 637ff, 649, 655, 
677f.. 860f. 
siiacc concepts, 849f, 
tiinc eonecpls. 8391, 
wiu'd-associalion, 842 
Cultural flames of reference, 838ff. 
Cultural gioiips, cross-conmai is<ms with 
biological groups, 130, 747, 76411., 
7721f' 

“rultuie-frce” intelligence tests, 726 
Curve of enor, 64 
Cycloid, 427 

Deaf-mutes, 409f. 

Deafness, Sec Auditory handicaps 
Decline of abilities with age, 28 3 IT. 
Dementia Piudcox, See Schizophrenia 
Dental caiics, and intelligence, 392 
Developmental acceleration, 
of gifted children, 599 
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Developmental accelei alion — Cotit’d 
of girls, 632ff. 

language development and, 652 
manual dextciily and, 649 
play activities and, 649 
Developmental “stages,” and culture, 
84811'. 

Developmental study of behavior, 128, 
14311. 

and stuictural coi relates, 157ff. 
human Ictus, 15 Iff. 
human in I ants, 153ff. 
infrahiiman subjects, 146ff. 
sctiiicnlial patterning m, 147ff., 154f,, 
156f. 

Differential Aptitude 4 csts, 462 
Differential psycliology, 
content of, 3ff. 
current trends m, 24 
caily publications on, I?f. 
histoiical development (4. 5ff. 
objectives of, 4L, 837 
Difficulty level of lest, 

effect on distribution cuiwc, 69f. 
effect on growth curve. 270f. 

Dinarie, 699 f. 

Distance, coneepts of, 84()f 
Distribution cuivcs, examples of, 71, 
7^ff 

asceiidance-submission. 85 
autonomic baKince, 77 
cancciJation, 79 
charactei tests, 86ff. 
height, 76 

imelligence test scoies, 80ff. 
ini i oversion-cxti over sion, 85 
learning, 80 
in o to 1 is ts ’ beh a V i o 1 . 75 
muscuiai tension, 78 
racing capacity, 971. 
visual acuity, 71 
vital »^apaeilv 76 

Distiibiil.on curves, factors influencing, 
66ff’. 

Distribution of individual differences, 

6()ff. 

Dominance, Sec Aggressiveness, Aseend- 
an*. e-subinission 

Domiiuuii-iecessive factors, 105, 

307f. 

Drawings, by childicii, 85 Iff. 

Dysplastjc type, 427 

Fetomorphy, 446f. 

Education, 

effects of special programs, 218ff. 
factor patterns and, 
intellectual decline and, 286f. 
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Fducation — C out d 

intelligence test perfoimance and, 
2^8f 728 
picschool 224jft 

race diffeiencts and 727ff 7‘>8f 77()f 
recognition of indnidinl diflcrenccs 
in 7 

region il diflFercnces and 7^61 
rui il arc is 8171 
sex differences in 623 
socio economic level ind 793 
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349 
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rural 815 823 
sex differences and 612 623f 
tqu ff units See Ineciudity ol units 
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Fslrogen, 631 
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Examiner effect of race of, 726f 
Experimental neuroses m animals, 569f 
r xperimental psychology, 

effect on diffeitnli il psychology, 9 
rise of, 9 

Lye coloi, and i \cc 693 

r aci il chai iclcnslies 
and behavior 38()fl 
in riciil classihe uion, 693 
FaUoi in ilysis 50111 
ipplieitions of 50811 
body build 43 If 
centroid mclhod 301 
ediieitionil lehievcmcnt 511 
infi ihum m gioups 319f 
intelligence 4961 
limit it ions of 5060 
oblique ixcs 303f 
oithogonil ixes 303 303 
person ilitv 3">0fl 
lotitjon ol i\cs SO Iff 
second ordci f iclois sOo 
Sensory md motor functions 309f 
‘ simple slnielLiu 3041 
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ti 1111111 ;^ experiments on 218ff 333 

voeitionil idjuslnient of, 3 4if 
rccblemindedness 34311. 
umona twins 334 
birth 111)111 les md 548f 
bodily dimensions and 384ff. 
tlinie il viiitties of 348ft 
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definitions of, 545 f. 

EEG and, 379 
“familial type,” 550 
health and, 554 
heredity and, 550fT. 
hierarchy of abilities in, 552f. 
incidence of, 547f. 
levels of, 546f. 
nutritional status and, 399 
sex differences in, 625f. 
undifiTereniiatcd, 550ff. 

Pels Research Institute, longitudinal 
studies by, 268f. 

Feral man, 129, 164, i82ff. 
critical discussion of, 183f. 

Fetal stage, 148 

behavior duiing, 15 Iff. 
brain reactions during, 159 
learning duiing, 153 

Fiji Test of General Ability, 726 

Follow-up studies, i>ec Longitudinal 
studies 

Foster children, 130. 327, 34811. 
adult achievement of, 348ff. 
and the nalure-nuituic question, 350ff. 
evaluation of lesearch on, 3611. 
family relationships of, 349f. 
of feebleminded mothers, 35911. 
resemblance to foster parents, 35 Iff. 
retests on, 355, 357f, 360 
social adjustment of, 348ff. 

“Freaks.” 383 

Free association tests, 10, 22 
occupational differences in, 842 

Flench, 700, 76‘'ff'. 

Frequency distribution, 601., aho 

Distribution ciiives 

Frequency polygon, 61 f. 

Functional characteristics, concept of, 
120ff. 

Functional disorder, 560 

Gene frequency analysis, 305fT. 

General lactoi, 4931. 

Genes, lU2ff. 

agents producing changes in, 11 If. 

Genic balance, 107 

Genius, 310ff., 576ff. 
birth order and, 590f. 
definitions of, ^76f., 606f. 
eminence and, 576, 60c»l. 
family background of, 586ff. 
insanity and, 578ff., 589r„ 593f. 
methods for studying, 584ff. 
personality characteristics of, 594 
sex differences and. 621 IT., 6251. 
statistical surveys of. 586ff. 
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Genius — Cont'd 
theories on, 577ff. 

See also Superior children 
Genotype, 306 

Germans, 700, 719, 765f., 768f. 

Germinal stage, 148 
Gesture, 747, 777ff., 843 
“Gifted” animals, 93f, 

Gilted children. See Superior children 
Glutamic acid, psychological effects of, 
401 

Goodenough Draw-a-Man Test, 738, 
741, 75()f., 804, 806, 814, 850f., 853 
Group, psychological concept of, 861ff. 
Group differences, 539ff'. 

Group factor, 494, 496ff. 

Ciroiip testing, 18ff. 

Giowth, 265 ff. 
age of cessation of, 282 
and icurnmg, 278 
sex clilfeicnccs in, 632ff 
Growth curves, 265ff. 

and age progress curves, 278 
and learning curves, 278 
composite nature of, 27411. 
height, 26(>f., 269f., 274, 386ff. 
individual diflerences in, 282 
intelligence test scores, 279ff. 
mechanical aptitudes, 277 
methodological problems, 265ff, 
of infants, 273, 276, 280 
of pienaial behavioi, 275 
specificity ot. 274tt., 287ff. 
weight, 386ff. 

Gypsy children, 8J If. 

Hair color, 

and personality, 381 
and race, 693 
Flair texture, and race, 693 
Handwriting, sex differences in, 663 
“Huid-ol-hcanng,” definition ol, 409 
Harvaid Growth Studies, 38, 2681., 280 
Health, 

and IQ, 391 

of feebleminded subjects, 399, 554 
ol gifted children, 599f., 604 
Fleight, 

and intelligence, 384ff., 600 
and race classification, 693 
distribution ol, 76 

growth cuives of, 266f., 2691’., 274, 
386ff. 

sex differences in, 63 Iff. 

Height-weight ratio, 437ff. 

Heredity, 

and behavior, 121f. 

family resemblance and, 118f., 305ff. 
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Heredity — Coni' d 
feeblemindedness and, 550ff. 
mechanism of, 1051!’. 
methods for study of, 127fF. 
muhiplc'f actor, 106f., 308 
nature of, 1020*. 

popular misconceptions regarding, 

psychoses and, 561 ff. 
race differences and, 782f. 
relation to environment, 112ff. 
unit-factor, 105 f., 307 f. 

Heterogeneity, effect on correlation, 
507f. 

Hierarchy of correlation coefl'icicnts, 499 
“Higher mental processes,’* culuiial fac- 
tors in, 740 
Hindu, 698 
Histogram. 61 f. 

Hisloriomctry, in studv of genius, 585, 
59 Iff. 

Homeosta'.is, 396f. 

sex differences in, 636f. 

Hookworm, and intelligence. 394f. 
Hoimonc, Sec bndocrinc glands 
Hybrid, See Race mixture 
Hydrocephaly, 549 
Hysteria, 563f. 

Tdiot, 546f. 

Idiol savant, 472ff. 

Illegitimate children, adoption of, 352 
Imagery ivpcs, 421 
Imbecile, 5461'. 

Immigrant groups, 703tT., 707f., 718. 
7640., 776 

*'lndiv]diiar’ trails. 524tf. 

Individuality, souiees of, 863f. 
Individuation, in behavior development, 
1471, 

Indo- Australian, 698f, 

Ineeiuahiv ol units, 
convened scoies, 460f. 
etfeel on distiibulion eurves, 7 Iff. 
growl h curves, 272 
practice experiments, 205 f. 

Infant bcliavioi, 

and infani-icaiing practices, 129, 164, 
ISUf. 

development of, 153iT., 273, 275(T, 280 
effect of institutional environment on, 
362fr. 

experimental restiiction of, 164, 179f. 
in rural groups, 817 
methods for studying, 17, 145f. 
of pre-term and post-term infants, 154 
tests on Negro childicn, 732 
training experiments on, 164, H'^ff 


Infrahuman organisms, 
abnormality in, 17 If., 569f. 
behavior development in, 146ff., 157f. 
experimental alteration of behavior in, 
165ff. 

factorial studies on, 519f. 
family lesemb lances in, 323 
individual differences among, 93ff. 
learning performance of, 94ff. 
pienatal alteration of structure in, 
108ff. 

reared in human environment, 173ff. 
recognition of individual dillerenccs 
by, 3 

sex differences in, 629f. 
sexual behavior in, 171 f. 
superior ability among, 93f. 

Insanity, 

genius and, 578ff., 5R9f.. 503f. 
historical concepts of, 544f. 
in families of supciior childien. 599 
race and. 7071.. 733 
socio-economic level and, 733 
See also Abnormality, Psychoses, Sub- 
normal deviant 
Instinct, 126, 165 
and culture. 859f. 

Institutional envi:onnicnts. 218, 362ff. 
effect on intelligence, 36^ U)5fr. 
effect on pel sona lily, 365(1. 

See also Oiphanage children 
Ini dligencc, 

amount of education and, 235if., 238f. 
bodily dimensions and, 384ff. 
cianial eai>acity and. 37511. 
cultuial concept of, 48g 
distribution of, SOlf. 
effect of institutional enviionmcnt on, 
36:ff. 

foster cliildren, 348ff. 

health and, 389tf. 

isolated groups, 8100 *. 

migrants, 76111, 82 Ilf. 

nature of, 488, 492ff. 

nutritional stains and, 398fT. 

orphanage children, 362ff. 

physical type and, 437lf., 441 ff. 

preschool atlcndanec and, 224rF. 

regional differences in, 756ff., 815ff. 

schooling and, 2l7ff. 

sensoiy handicaps and, 406ff. 

sex differences in, 649ff. 

socio-economic level and, 797ff., 829f. 

twins, 335ff. 

type of neuiosis and, 565 
urban-rural differences in, 815ff., 825 ff. 
Intelligence quotient, 16f., 33ff. 
age dcciement in, Stiff., 816f. 
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changes in, 254 
constancy of, 255, 292ff. 

interval between retests and, 294 
“overlap” and, 294f. 
regularity of development and, 295f. 
distribution of, 80, 84 
instability at early ages, 25 'Of. 
of eminent men, in childhood, 592f. 
Intelligence tests, 15(1, 

age dilfeicnccs in, 272(1., 279fT. 

cultuie and, 733(1.. 74011., 829f. 

distribution of scores on, 8011. 

efl'ects of coaching on, 2001. 

ellects of language handicap on, 717(1. 

effects ol practice on, 19511., 253 

lor the blind, 40h 

in Scotland, 8211., 820 

inicipretatiori of, 48611. 

migrants, 76111., 821(1. 

of complete popiilal'oris, 82(1. 

older persons, 282(1. 

puberty and, 405 

lace mixture and, 7491T. 

relation to intelligence, I'^Sf. 

I'ole of exaniinot in, 2491 , 726f. 
rural children, 81511 , 825(1. 
schooling and, 217ff.. 235, 238f. 
semantic tiaining and, 221 
sc\ differences and, 6141. 64911. 
super loi children, 585. 59811. 
trait variability and, 4571. 
validation ol, 5 I II., 487 
Interaction of heredity and environment, 
1 1 3ft. 

as related to practice effect. 2131, 
various interpretations of, 1161. 
Inteicolumnar conelation, 4'. ■. 

Interests, 

factorial analysis of, 5221. 
sex didcienccs in. 663(1. 

SOL io-cconornic diffciences in, 795f. 
Internal consistency, 
and reliability, 431. 
and validity, 53f , 55f. 

Tntia-iacial comparisons, 747 
Intiovcision-cxti'oversion, 85, 425f. 

sex dillcrenccs in, 6741. 

Inventors, study of, 582 
Inverted factor analysis, "^111 
Isolated groups, studies of, 810(1. 
Isolation amentia, 189 
Italians, 6951., 700, 704, 718ff., 730, 739, 
766, 768f., 77811. 
hem difficulty, 

socio-economic differences in. 82811. 
iirban-riiial differences in, 82511. 

S'r also Inequality of units, Scaling 
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J-curve, 65, 74f. 

Japanese, 704, 72411., 738f. 

Jews, 

investigations on, 695f,, 72()f., 733, 739, 
7661., 778(1. 

racial composition of, 701 
Jukes, 3131. 

Juvenile authors, 489, 596 

Kallikaks, 314ff. 

Kaspar Hauser. 18711. 

Kuder Preference Record, 468f. 

sex differences on, 666 
Kuhlrnann-Bmet, 17 

Ladogan, 6991. 

Language, 

among “wild children,” 182, 185f., 
187f. 

and concepis, 8 ‘>5(1. 
and Inc I(J. 254 
cultural differences in, ^‘'flf. 
effects on behavior, 85511 
in the concept of feeblemindedness 
5451’. 

response to. by chiinpan/ees, 17^11. 
training expenmenis on inlarUs, (76 
Language development, 

and developmental accelcialion of 
girls 652 

effect of auditory handicap.-^ on, 409(1. 
only children, 3 36 
orpliariagc children, 367 
sex differences m, 65|ff. 
socio-economic diffciences in, 793 
t^Mns, 335(1. 

See also Verbal aplitude 
Langiiaizc handicap, 71711. 

American Indian, 722(1. 
bilingual American, 718(1. 

Irish, 7211. 

.lapanesc, 7?4f. 

Welsh. 7201. 

Lappish, (»99 
Latah, 568 

“I atm” race, 700, 771 
Learning, 

and giowth, 2651., 278, 296(1. 
and iTialii^alion, 1241. 
expciimcnts on, 165(1., 173(1., 

1791. 

factorial analysis of, 508 
in adults, 2891'. 
in feebleminded, 553f. 
in inirabuman organisms, 94ff. 
Learning curves. 210, 278 
Leplosome, 426f. 

Levels of confidence, in statistics, 61Cf. 



888 Subject Index 

“Lightning calculators,” 470ff., 475f. 
Linguistic categories, and race, 700f. 
Longitudinal studies, 
and statistical legiession, 241 IT. 
comparability of tests in, 25111. 
foster children, 357ft. 
gifted children, 585, 601 ft. 
growth, 267ft., 280f. 
methodological pioblcms in, 239f. 
preschool children, 227L 
selective factois in. 231, 239, 241 
sex diftei cnees in IQ, 629 
“Lump scores” on intelligence tests, 
457f., 621 

Malaysian, 698f. 

Malnutrition, psychological effects of, 
398fl'. 

Manic-depressive p.sychnsis, 558, 563 
Marital adjustment, of gifted subjects, 
6()4f. 

Maseulinitv-femininity index (M-F), 
6781T. 

cultural factors and, 68<)ft. 
occuinilional diftei cnees in, 681 f. 
physical chaiactensiics and. 680, 682 
specificity of group diftei ences in, 680 
Matched group studies, 

a posicnott matching in, 240f. 
and statistical regiession, 247 (T 
meihodoUigical problems in, 2391T. 
role of examiner in, 249f. 

Maternal diive, and culluie, 859f. 
Malhematical ability, 
organization of, 511. 527 
sex differences in, 660 
iS.'V also Numerical apliliule 
Malm at ion, 

111 behavior development, 124f., 154ff., 
165ft\, 265f. 

sex d ill ei ences in rale of, 632tT. 
Mechanical aptitude, 473, 484f. 
sex difterenccs in, 632, 655tr. 

Sec also Spatial aptitude 
Mediterranean racial group, 698, 700f., 
765, 7f.7ft., 772fr. 

Melanesian, 698 f. 

Memory, 1931., 475 
effect of cultural factors on, 841 f. 
sex differences in, 654 
Menioiy span, tiaining in, 193f. 
Mcnarcheal age. See Puberty 
Mental age, 16, 33f., 459 
in growth curves, 272 
Mental imagery, lOf. 421 
Mental Measurements Yeaibook, 24 
Mental set, in test administration, 250 
Mental tests, llff., 29ff. 


Merrill-Palmer Scale, 17 
Mesomoiphy, 446f. 

Methodological problems, 
group difieicnccs, 613ff. 
longitudinal studies, 239ff. 
race diffei ences, 689ft., 713ff. 
schooling studies, 239ff. 
sex diftcronces. 61 3ft. 

Microcephaly, 375, 549 
Muldh’tmvu, 788 

Midwest^ 803ft, Sec also P tabic City 
Migration, 747, 7560'.. 760ft., 821ft. 
Minncfola Home Status Index, 802 
Minnesota Miiltiphasic Personality In- 
ventory, 467, 795 
Minnc.sofa Preschool Tests, 17 
Miscegenation, See Race mixture 
Mitosis, 103 

Mongolian race, 698f., Sec also Chinese, 
Japanese 
Mongolism. 548 

Mongoloid race, 698f., See also Ameri- 
can Indian, Chinese, Japanese 
Monsters, experimental production of, 
109f. 

Moron, 546f. 

Morphologic index, 423 
Motivation, 

and race diftei ences, 734 
cultural diftei ences in. 859f. 
elTect on i^telll}J^'ncc test performance, 
734 

Motor abilities, 
factor'ia) analysis of, 509f. 
sex diftei ences in, 648f. 

Motor habits, and culture, 842f. 
Mountain children, studies of, 812ff. 
Multimodal disti ibution, 66, 68f., 428f. 
Multiple 1 ’actor theory, 496ft‘. 

Muscular reactivity, sex differences in, 
632 

Muscular tension, disli ibution of, 78 
Musical aptitude, 469f.. 474f., 483ft., 497 
race diftci ences in, 716f. 
sex differences in, 659f. 

Musical taste, cultural factors in, 845ff. 

Naive observer, in art, 844f, 

“National character,” 775ff., 787 
National groups, 700f., 764ff., 775ff. 
Negro, 698f., 706f., 717, 725ff., 727ff., 
729ff., 734, 736f., 751ff., 757ff., 771f., 
792, 821f. 

Negroid race, 698f. 

Neonate, 144, 153 

Nervous system, and behavior develop- 
ment, ]57ff. 

Neurasthenia, 564 



Neuroses, 563ff. 

among native African troops, 568 
and constitutional type, 43 3f. 
relation to intelligence, 565 
Neuroticism, 

&c\ diflcrences m, 675if 
socio-economic differences in 79'^ff 
Nordic, 689t., 698, 700f , 765, 76711., 
772a 

None 699f. 

Noiinal piobability curve 62ft 
and hcredit\, 106, 308 
in test construction, 881 
Normative developmental studies, 128, 
14311. 

Norms 

concept of, 311 
specific ity 371T 

Numeiieil aptitude, TOft , 47^f 48*^1 
sex; dillerenees in 6^71 
Nurseiy school Sw Pi csehool attendance, 
riesehool testing 

Obsticle sense of the blind 408 
Occupational abilit> patterns, 465ff, 

511 

Oecupitionil achievements 
correlates of success in, 60‘>f 
feebleminded 5541T. 
gifted men 60^ 
giltcd women, 604, 624 
Occupations 

free association icsponses and. 842 
intelligence and 7971 
M r index ind, 6SI1. 
pitcrml and child intelligence, 8001 
prcfcnecl aiiditniy tempo and 84511 
piimitive eiillLiKS 63"ft 
S( < aho Soeio economic level 
Old Cjt}, 7S8, 790f 
One loom schools, 23*^ 815 

and intcllu’cnec Psi peifomnnce 817 
OiiK childien, language dcv^lupmcnt of 
338 

Organic disordci, 560 
Orphan igc child! en 36211 

and Iceblcmindcd women, 364f 
intellectual inlcnoritv of 3631 
langu 1 "'" dmeloppient ot, 367 
ncgatJMsm in 366 
preschool altcndancc, 2291, 364 
regression in, 366 

Overlapping of distributions, 98, 285f , 
618ff, 760, 808 

Oxygen deprivation, effects ol, 396 

Pantomime, in testing procfdurc, 725 
Paranoia, 557f. 
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Paient-child correlations, 318fF., 351, 

353, 356, 358, .362f. 

“Partially seeing,” dehnition of, 407 
Pathological concept ot abnormality, 
5411 

Pathological conditions, effect on distri- 
bution curves, 731 

Pathological iheoncs of genius, 578ff. 
cntic]uc of 5 KOI. 

Pedigiec studies, See Family history 
method 

Peiccntilc scoics, 35f, 65f, 459f. 
Peiception, 

cultuial factois in, 839ff 
effects of isolation on, I88 l 
lactoiiil analysis of, 5081. 
sex dilfcicnccs m, 6481 
Pcifoim met sc des 171 
sex diftcicnccs in, 657 
Persistence tests, 

constitu lonal tvpe and, 441f. 
distnbution ol scoics on, 87 
race and 7721 
Personal eciualion, 7f, 249 
Pci son ilily, 
age anil 290 
blindness .nd 4081 
blood chemistrv and 396f 
cultuie and 64011, 747, 771ff, 850f, 
S59f 

deafness and 409ff 
disordcis ol, 5*570 , 563fT 
distiibLilicm cui-vcs ol, 85jT 
IhCf and 376f. 

cllects of inslilutmnal cmvuonmcnl on, 
^6511 

effects of stanation on, 401 f 
1 ici d cli 'I (Llci islics and, 380ff 
1 iclorial ciiiilvsis ot, 510. 520ff 
family resemblances m, 309, 319f, 

3221 , 3 ^011 
gciiiis anti •'93f 
gl indiil II del Lts ant, 395f. 
ban coloi and, 381 

phssiqiie and 42lff, 4'lff., 4'^81 , 
441 IT 4490 

psvcl’crstim itic disordcis and, 397f 
pnb(lt^ ind 405 
race .rd, 7n, 771 ff 
sensory handicaps and, 406ff 
sex diftcrcnces in, 640ff , 663ff 
social clas" and, 79011 
supenoi children, 600f , 604 
tests ol, 22ft , 541f. 

Pei version, concept of, 859 
Phenotype, 305f, 

Phenylpyiuvic amcnlia, 549 
Photographs judging traits Irom, 382 
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Phrenology, 374f. 

Physical defects, 389fT. 

sex differences in incidence of, 634ff. 
Physiognomy, 38()f. 

Physiological factors, 
and intelligence, 388ff. 
in sex differences, 63 Iff., 677 
Pictures, use in testing, 72^f. 

Pignet index, 442 
Pilot selection battery, 50f. 
Pintner-Paterson Scale, IS, 722f., 736f., 
750, 753, 762ff., 768f., 814, 817f.. 
819, 825 

Plmmille, U. S. A., 788 
Play activities, 

gifted children, 600 
Negro children, 733 
sex differences in, 640. 664f., 673 
socio-economic differences in, 793 
urban-rural differences in, 815 
Polynesian, 698t. 

Populatron, 
defimliori of, 615 
testing of complete, 82ff. 

Potentiality, Sec Capacity 

J*olla1ch, 860 

Practice, 

effects of, 129, 193ff. 
growth and, 26'!f., 278 
heredilv-environmcnt problem and, 
211fL 

intelligence tests and, 195ff. 
variabrlily and. 202ff., 208ff. 

Prairie City, 788, 794, 803, Sc<t' ul'>o 
Phntown, Midwest 
Prejudice, 690 
J'renatal behavior, 11 Of. 

age changes in, 146(1', l.'^llT., 274f. 
human siibiccts, 15 Iff. 
infrahuman subjects, 146tr. 
kaimng in, l'^3, 155f. 
methods (or studying, J44r, 

Prc-piiberial grovvlli spurt, 269f. 
Preschool attendance, 

and emolional changes, 250f. 
effects on inleiligciice, 218, 224ff. 

See also Schooling 
Preschool testing, 
instruments lor, 17 
negativism in, 251 
predictive value of. 253ff., 293f, 
seasonal variations in, 2*)6l. 

“Primary Mental Abilities,” 
and factor analysis, 497 
sex differences in, 65 8 f. 
socio-economic differences in. 803ff. 
tests of, 461 

Probability, statistical, 62ff., 6l6f. 


Profile chart, 
definition of, 459 
examples of, 46 Iff.. 467f. 
methods of plotting, 459ff. 

Projective techniques, 23f. 

P.sychasthenia, 563 

Psychoanalytic theories of genius, 581f. 

Psychograph, See Profile chart 

Psychological testing, 29ff. 

Psychoneuroses, See Neuroses 

Psychoses, 557ff. 

constitutional type and, 43 Iff. 
heredity and, 56 Iff. 
intcllcclual level and, 558 
organic versus functional, 560f. 

See also [nsanity 

Psychosomatic disorders, 397f., 41 3f. 

Puberty, 

developmental rate, 269t., 403 
intelligence and, 405 
onset in gifted children, 599 
personality changes. 405 
sex dillcrencc's in onset of, 633 

“Pure types,’’ studies on, 430, 440ff. 

Pygmy Black, 698f. 

Pyknic, 426 

Quadruplets, 340 

Qualrlalivc dillcienccs, 59^, 582f. 

‘‘Qualitalive“SupeMoiJl>” theory id' genius, 
582t. 

“Ouanlilalive-supciiorjtv’' theory of 
genius, 5831. 

Quintuplets, 336t., 339f. 

Race, 

classification of, 692ff., b97fl. 
ctileiia of, 692(1. 
dclimtion of, 692 

Race diilci cnees, 689ff. 

Clime and insanity. 707 f.. 733 
cultural achievements, 7I4ft. 
evaluation ol, 78111. 
heredity and, 782f. 
language handicap and, 7l7ff. 
methodological problems, 690f., 692ff., 
7I3ff. 

musical aptitude, 717 
peisonality, 733 
play activities, 733 
school attendance, 759 
schooling and, 727ff., 770f. 
sensory acuity, 716 
socio-economic level and, 729ff. 
specificity of, 73811. 
surveys of data, 69 If. 
theories regarding, 689f. 
verms cultural differences, 746ff. 



Race mixture, 70 Iff., 747ff. 
achievement and, 702f. 
incidence of high IQ and, 75 3f. 
physique and, 70 If. 
test peifoimancc and. 749ff. 

Racial classification, 692ff., 697ff. 
evaluation of, 695ff. 
linguistic groups and, 700f. 
national groups and, 700f. 
race mixture and, 701 
Racing capacity, distiibutioii of, 97f. 
Range, efl'ccl on 

correlation coefTicicnts, 507 
distribution ciiives, 69 (. 
reliability coeflicierits, 44f. 

Rapport, 31, 7261. 

Ratings, in sex diHercncc studies. 620 
Rational equivalence, method ol, 44 
Reading disabilities, sex dilTeienccs in, 
652 

Reasoning, factorial analysis of, 497, 509 
Rectangular distribution 65f. 

Reduction division, 1031. 

Reflex, 1251. 

Regional differences, 747, 756ff. 
fill rope, 8191 

United States, 727f., 756IT , 794, 81511. 
Regression, statistical, 2410. 
eiior of measurement, 243 
giuup comp<iMSC)ns, 247(T. 
individual comparisons, 242IT. 
leveling and, 24411, 
picschool studies, 244 
test I el lability, 242fT. 

Relative vai lability, See VaiiabiLly 
Reliability, 

concept of, 39f. 
of ditferences, 615fF. 
of slalistical measures, 615fT. 
of tests, 39fT. 

and regression effect, 242fT. 

Reliability coeflicient, 401. 
behavior, il fliK lualions, 4 If. 
ctlect of range on, 44f. 
internal consistency, 43f, 
long -1 ange prediction, 255 
Repetition, 

elTects on intelligence tests, 195fr., 

253 

qualitative effects of, 1971’. 

Reversion, concept of, 859 
Rh factoi, in fecbleminilcdncss, 549f. 
Rigidity of behavior. 553 
Roles, specialization among twnii, 339f. 
Rorschach Test, 23, 776 
Royal families, hemophilia in, 106 
Ruial, See Urban-rural diffeiences 
Rural schools, 235, 815, 81 7f. 
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Sampling error, 61511. 
factor's influencing, 617f. 

Sampling problems, 

group comparisons, 613ff. 
race drffercnccs, 7()3ff., 7061f. 
schooling studies, 239ff. 

Sampirng theory, 4941f. 

Scaling, 826, See also T-scores 

Scatter, See Trait variabilrty 

Schrzoid, 427, 561 

Schizc^phrenia, 558 

family studies ol, 56 Iff. 
in twins, 562f. 

Scholastic Aptitude Test, 653, 657f. 

Scholasticism, 6 

Schooling. 

ciitiqiic of studies on, 219, 222ff,, 23"^, 
237f., 2390., 257ff. 
cfl'ccts of, 129, 2i71f. 
race differences and, 727(T, 770f, 
relation to lieiedilv-cnvironment ques- 
tion, 2570. 

retarded childien, 218ff. 
luial childien, 235 

Screening, use of tests in, 47ff. 

Seashoie Mcasuies of Musical Talent, 
470. 483, 659. 717 

Seasonal variation in preschool IQ’s, 
25 6 f. 

Sclcctnc bleeding, 128, 136fl. 

Selective raclors, 
adoption, 352, 356f. 
college cniollincnt, 653, 775 
coinp.u icons of social classes, 794 
cross-sectional studies, 267t., 284f. 
gioup comparisons. 706fl. 

immigration, 704, 760ff. 
in.stitutionalizatiun, 388, 625f., 708 
longitudmai studies, 231, 239, 241, 
268 

matched-group studies, 241 
orphanage populations, 363 
race mixture, 748 
racial cimipaiisons, /06fT. 
schools lor the blind, 408 
sex cliflerences, 6l3ff.. 62jf., 650 
sexual behaMor studies, 792 
sibling correlations, 320f. 
lest no ms, 2821'. 
twin studies, 330, 345f. 

2alcctive migration, 76orf., 821 ff. 
Semantics, and intelligence test per* 
fotinance, 221 
vScmitic. 700 
Senescence, 298f. 

Sensory capacities, 

factorial analysis of, 510 
of blind subjects, 408 
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Sensory capacities — Coni d 
race ciifteienLCS in 716 
stx differences m 647f 
Sensory handicaps 40‘>ff 
Scqutnliil pittcrmng of bth'^vior dcvtl 
opment 144 147ff, 1541, 156f 
Sex diflercnccs 612ff 
achievement 621 IT 
birth rate 6 IS 
dcvelopnunl il rite 6'12fr 
cducUionil ichicvtnient 6600* 
fcebltnundcdncss 62Sf 
heicdity and cnviionmcnl in 612 6"^3f 
incidence of defects 634fl 
incidence ot hii^h IQ 62 1 
infrihumm oij^inisms 62^^f 
intelligence tests 614f 649IT 

mcthodokvic il problems 6l3tT 
niorl ilit> 63411 
musculij icietiMts 632 
person ililv 64011 ( 63il 

phv lologic il 1 ictois 6^17 
lolc of inltiire in 61711 
senson motor functions 647fT 
speci il iptitndcs (SliT 
SLUVL>S ol 61 C47 

tr nl orj, ini/alion ''171 
V nubility 62411 

Sex influf need hctois 106 307 
Sex limited 1 ictors 106 
Sex linked fictois lO'^l 307 
Sex lolts and culture 642 
Sexu il be h i lor 

eulluial f icUrs in 17^ 70^ 843f 860 
in infr ihum in sul jee I 1 1 1 ( U) 

111 wild childien 1 6 
ocio eLfmoimc level ind 792 8 Hf 
Sibnn s 104 320IT 3*^ I 

Siun » stoics Sn St ind lid seoies 
Sunilic \nc( ol i diltcicn (I^IT 
Sim ^ciic C ird ioi SouoFconomic 
St tns 79S hOlf 

Skv^wtd distf ibiilio 1 6ft 67f 691T 
Skin color ind i ue ( 7^10 

Sociil tl ss 7S71T 

iiikli nee ind 79711 8001T 82S1T 

inelh uls ft j studvirip 7S81 
persoinlitv m 7^01 
proportion ol ptisons m e leh 789 
soei il iieispcctivi nitl 7^(1 
Sociil eonstr lints cllccL on distiibution 
cuivcs 74f 

Social expect incv 6231 812 

Soci il orient ition sex tblTerenees m 
6721T 

Socioeconomic level Ik) 7871T 
attitudes ind 79f f 
child rciim^ pi ictiecs nul 792f 


Socjo economic level — Cont d 
education and 793 
genius and 586f 
insanity and 733 

intelligence and, 6031 , 797ff , 800ff , 
82Sft 

in lorei^n countries 808 
mlcipict lUons ot 8091 
mtcicsts ind 7951 
lingiiagc development and 793 
nicisuremeni of SO 111 
ot communities and IQ 8071 
of mmigiant >,ioups 7051 
pciscnality ind 733 79011 
physic il condition ind 3)41 398 412 

lacc diffti cnees and 72911 75Sf 
sexuil bchivioi ind } )2 84 3f 
superior childitn and 59) 794 
Somatotonn 449 
Somuolypc 446f[ 

Spice contcpls of 8401 
Sp itul iptitudc 477 
sex dilTcienccs in 65sfT 
Stc also Mechinicil iptitudc 
Spe iim in Hiov n toimula 43t 
Spcci il ipt iLitles ■’011 4f )11 
in the Iccblcmmdtd 472li 5S2f 
sc\ dillcitnccs in 65 Iff 
twin icscmbline s in 3^0 34('f 
Spcci il Iiainint Units It S 'iiiv 221 
729 

Spcci ili/alum uf ability 4 7ir 
^p cilic 1 clois 493 

Speech clisoidtis sex diOtunccs in 652 
Sp ed 

ind cultuic 7''n 
lictoiiil umIvsis of 497 50S 

m Ustinc oltlci jitrsins 2SS 
ricc ditli ren es 7 ( I 
inbiniui 1 djllcienccs 817f 
Spill liilf icehniquc 43f 
Spuiioii ctnclilion )S 
St ind irtl cic\ I iliun 36 
Slinttud cnoi of 
dilleunce 616f 
cslim lie 46f 
me in 618 
ucoic 44t 

Stindud sc ires 361 460 

St\nd iidi/ ilion ol psychological tests, 
30f 

St inford Binct lOf 

ad ptition foi the blind 40(> 

and piieiitv^l occupilion 80 )f 

dislribution e^i IQs 01 80 84 

effect ol CO ichmg on 200f 

sex differences on 6501 

urb in rural differences on 816 825f 
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Stanford University study ot gifted chd 
drtn 4791 ^9Sft 601ft 624 628f 

eviluuion ot 606 
“Staired men ot science ’ ^86f 

sex difTerenecs in me denec of 622 
Statistic li it>nctpl ot ibnonnalily *i42t 
Statistic il meihodb 11 
Statistic il sur\c>s of emus 5S41 5860" 
(21ft 

Stilus me isuremcnt of 79S 
Slei cotypes '^S2ft 4n 4s2f 620 62S 
63 7 672 777 

Strong Vocational Intt.iest Blink 
faetoiul an il vs IS ol *'‘’21 
occupitionil level scoic (O I ) 706 

sex diffeicnecs on 6(6 
SOCK) econom e dil^ icncts on 70^f 
Stiuctui il eh II iclerisiics concept of 
120tl 

and gro \th 2( ‘^f 
Stuictiir il coiieliles of 

bch iMor dcvekipinent 128 !‘‘7rr 
indivicluil dittcrenccs in beh ivi h HO 
140 142f 373(1 411ft 452f 

Stiuetui il limit itions 

ind ntclkctii il decline 208f 
m behiviu ckxelcpmcnl 26 37 ft 

Subnoimil deviint "^441 
dcmoriologie il view 544 
medic il view ^441 
psycholot ic )l view 545 
Siiptnot ehildien 50 ft 

adiih i hicvtments of (OUT 
coiiclitcs ol success in 605f 
cast studies of 
cducilioii of 501 5 18 (OOft 

fimily bickt^fOiind ( 59:^ 

hcilth incl phvsicd traits c 38 41 
5001 e04 

longitLid n il studies ol 601 IF 

miiJtaJ si itus ot 60Mf 

musie il iptitudc of 469f 

Ncgio 7 31 

oftspi ing ot 605 

pcisonility 597f (OOf 

pliy icliMtis 600 

piolile ehiits of 4(31 

sex rilios inio ig C’Sf 

speeiili/ ition ol ih lilies n 479f 

test surveys ot, 59SfT 

t ratio 6l(f 
r scores 37 461 8'’7 
Tetrad criterion 4991T 
Time, concepts of, 839f 
Time limit method 203 
Time scenes 204f 
Tonsils ind intelli^enee 392f 


Tiainmg, 

and giowth 265f, 278, 296ff 
experiments on anira ds, 165ff, 173ff 
txpciiments on intajnls, 164, 175ff, 
179f 

See aho Piaetiee Schooling 
Tr lit concept ol 492 4081 , 526ff 
Irait oicanization 492tf 
age difteienecs m 513(1 
eduealionvl dilfeienecs in 5l5fl 
elleet of expentnee on 526f 
efteel of pi leticc on 527 
cxpennicnl il ippniich to 5261T 
gioup difteicnces m 5l2ft, 51811 
mil ahum 111 eioups 5191 
methodology ol 499ft 
oceupitonil dilleicnees in 5l8f 
pciscnilitv 520ft 
sex difteienecs in 5l7f 
iheoiics of 49 3fl 
lint \ ijiibilitv 476fT 
anility level md, 47711 
ige md 4S 1 

mteieojielitions in rcl ition to, 

4S2t 

p< isoii ility eh 11 leleiisiies md 481 
pi letice md 481 
Triplets 3 8 
Iropism 125 
liLit dilTeienee 616 
‘liue sec 1 C 45 

Ivvins 1041 130 164 175(1 327fT 

d vckpmcnt ol psve hoses in 562f 
iiiteinil \i}sus identic il 104f 327 

3 ^ft 

jdeiine il idcntilie Ition ot 3281 
inttUcetLiil jnfcnoritv of 3 3 ft 
1 iuiir.e development ot 335(1 
1C lied ipul '‘4011 
leseiiibl mets between 3'’8fT 
soeial inter letion ol 336 f 
Two I letor iheoiy 49'’ft 
I ype i lelors 5 p 
lypc theories 6( 42111 

at, ml 433 43(11 448 

eericUtionil studies on 4^7IT 
hsLoiyol 422ft 
1 le el 42 (1 
ps ehts s md 431ft 
nee and 69-+1 

• udies on puic typib ’ 43li 44(MT 
typology, Sec lypc theories 

Umversity of C aliform i Socio Le moniic 
Index 802 

Unleirned beh ivior, 122fF 165 

I ni elated children correl ition between 
323f 
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Urban-rural differences, 794, 815ff. 
Europe, 819f. 

selective migration and, 82 Iff. 
specificity of, 824fT. 

Validation, 45fT. 

intelligence tests, 5 Iff. 
personality tests, 54fl. 

Validity, concept of, 45 f. 

Validity cocflicient, 46 

Valuational concept of abnormality, 541 

Vanability, 

age and, 273f., 284f. 

effect of practice on, 202/1., 208/1. 

in different trails, 8SilT. 

infiahiiman oiganisins, 93fT. 

iclativc nieasmes of, 900'., 2()6f. 

sex dilTeiences in, 624fT. 

within the individual, 4570. 

Variance, 212, 482 
Veddoid. 698f. 

Veibal aptitude, 4«5f, 487r., 497, 509, 
552f. 

sex dillerenccs in, 6'>ltT. 

Vial)ilily, sox difloioncos in, 6^40'. 
Vineland Soci.il AhiUiiily ,Vaic, 547 
Viscciotonia, 449 
Visual acuity, distribution of, 71 


Visual handicaps, 406ff. 

IQ and, 407E 
personality and, 408f. 
sensory discrimination and, 408 
Vital capacity, 
distribution of, 76 
sex differences in, 632 
Vilaniiiis, psychological effects of, 400tf. 

Wcchsler-Bellcvue Scale, 18. .52, 481, 
514f. 

Weight, 

iiUclljgencc and, 384fl., 600 
sex diffeic rices in. 6.1 iff. 

Whittier Scale foi Giading Home C 
ditions, 599, 801 
Wild Boy ol Aveyron. ISSiT. 

Wild children, .See hcral man 

Windigo psvchosis, 5(iS 

Wolf childicn ol Midnaporc, 186f. 

Woih meilioils, 

j elation to practice 199, 206, 214 
role m individual diOeiences, 179 
trait oigani/.iiiCMi and, 527i. 

Vafikrc Ciiy, 78cSI. 

7-score.'>, See Siaruiaid sv.orcs 



